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1 PVS 

Method for stable chromosomal multi-copy integration of gen s 

Field of the Invention 

The invention relates to a method for inserting genes into the 
5 chromosome of bacterial strains, and the resulting strains. In the biotech 
industry it is desirable to construct polypeptide production strains having 
several copies of a gene of interest stably chromosomal ly integrated, 
without leaving antibiotic resistance marker genes in the strains. 



10 Background of the Invention 

In the industrial production of polypeptides it is of interest to 
achieve a product yield as high as possible. One way to increase the yield 
is to increase the copy number of a gene encoding a polypeptide of interest. 
This can be done by placing the gene on a high copy number plasmid, however 

15 plasmids are unstable and are often lost from the host cells if there is no 
selective pressure during the cultivation of the host cells. Another way to 
increase the copy number of the gene of interest is to integrate it into the 
host cell chromosome in multiple copies. It has previously been described 
how to integrate a gene into the chromosome by double homologous 

20 recombination without using antibiotic markers (Hone et al . , Microbial 
Pathogenesis, 1988, 5: 407-418); integration of two genes has also been 
described (Novo Nordisk: WO 91/09129 and WO 94/14968) . A problem with 
integrating several copies of a gene into the chromosome of a host cell is 
instability. Due to the sequence identity of the copies there is a high 

25 tendency for the them to recombine out of the chromosome again during 

cultivation of the host cell unless a selective marker or other essential 
DNA is included between the copies and selective pressure is applied during 
cultivation, especially if the genes are located in relative close vicinity 
of each other. It has been described how to integrate two genes closely 

30 spaced in anti-parallel tandem to achieve better stability (Novo Nordisk: WO 
99/41358) . 

The present debate concerning the industrial use of recombinant DNA 
technology has raised some questions and concern about the use of antibiotic 
marker genes. Antibiotic marker genes are traditionally used as a means to 

35 select for strains carrying multiple copies of both the marker genes and an 
accompanying expression cassette coding for a polypeptide of industrial 
interest. In order to comply with the current demand for recombinant 
production host strains devoid of antibiotic markers, we have looked for 
possible alternatives to the present technology that will allow substitution 

40 of the antibiotic markers we use today with non-antibiotic marker genes. 

Thus in order to provide recombinant production strains devoid of antibiotic 
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resistance markers, it remains of industrial interest to find new methods to 
stably integrate genes in multiple copies into host cell chromosomes. 

Summary of the Invention 

5 The present invention solves the problem of integrating multiple 

copies of a gene of interest by homologous recombination into well defined 
chromosomal positions of a bacterial host strain which already comprises at 
least one copy of the gene of interest in a different position. This is done 
by making a deletion of part of a conditionally essential gene (hereafter 

10 called the "integration gene") in the host chromosome (or by integrating a 
partial gene into the host chromosome) of a strain which already comprises 
at least one copy of a gene of interest, so that the resulting strain has a 
deficiency, or a growth requirement, or is sensitive to a given stress. The 
next (i.e. second or third etc.) copy of the gene of interest is then 

15 introduced on a vector, on which the gene is flanked upstream by a partial 
fragment of the integration gene, and downstream is flanked by a fragment 
homologous to a DNA sequence downstream of the integration gene on the host 
chromosome. Thus, neither host chromosome nor the incoming vector contain a 
full version of the integration gene. In a non- limiting example the host 

20 chromosome may comprise the first two thirds of the integration gene and the 
vector the last two thirds, effectively establishing a sequence overlap of 
one third of the integration gene on the vector and the chromosome. 

Expression of the full version of the integration gene will only 
occur if homologous recombination between vector and host chromosome takes 

25 place via the partial integration gene sequences, and this particular 
recombination event can be efficiently selected for, even against the 
background of homologous integration into the chromosome directed by the 
gene of interest into the identical gene(s) comprised on the chromosome 
already* 

30 This strategy will enable directed gene integration by homologous 

recombination at predetermined loci, even though extended homology exists 
between the gene of interest on the incoming vector and other copies of this 
gene at other locations in the chromosome, and even though it is not 
feasible to identify the desired integrants based on the qualitative 

35 phenotype resulting from expression of the gene of interest, as this gene is 
already present in one or more copies in the host . 

In a non- limiting example herein a Bacillus enzyme production strain 
is provided that comprise two anti -parallel copies (inverted orientation) of 
a gene encoding the commercially available amylase Termamyl* (Novo Nordisk, 

40 Denmark) . A gene homologous to the dal gene of Bacillus subtilis, encoding a 
D-alanine racemase, was identified in the Bacillus production strain, it was 
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sequenced and a partial deletion was made in the dal gene of the Bacillus 
two-copy Termamyl* strain. A vector was constructed to effect a stable non- 
tandem chromosomal insertion of a third Termamyl* gene copy adjacent to the 
dal locus, in the process effectively restoring the complete dal gene, 
5 according to the above strategy. 

In another non-limiting example herein, an additional copy of the 
amylase encoding gene was introduced into the xylose isomerase operon of the 
Bacillus enzyme production strain which already comprised at least two 
copies of the amylase gene located elsewhere on the chromosome. 

10 Also in a non- limiting example we demonstrate the method of the 

invention by integrating an additional amylase- encoding gene into the 
gluconat operon of the Bacillus enzyme production strain. 

Accordingly in a first aspect the invention relates to a method for 
constructing a cell comprising at least two copies of a gene of interest 

15 stably integrated into the chromosome in different positions, the method 
comprising the steps of: 

a) providing a host cell comprising at least one chromosomal copy of the 
gene of interest; 

b) altering a conditionally essential chromosomal gene(s) of the host cell 
20 whereby the gene becomes non-funt ional ; 

c) making a DNA construct comprising: 

i) an altered non- functional copy of the chromosomal gene(s) of 
step b) ; and 

ii) at least one copy of the gene of interest flanked on one side by 
25 i) and on the other side by a DNA fragment homologous to a host cell 

DNA sequence adjacent to the gene(s) of step b) ; wherein a first 
recombination between the altered copy of i) and the altered 
chromomosomal gene(s) of step b) restores the chromosomal gene(s) to 
functionality and renders the cell selectable; 
30 d) introducing the DNA construct into the host cell and cultivating the 

cell under conditions that require a functional genets) of step b) ; and 

e) selecting a host cell that grows under the conditions of step d) ; 
wherein the at least one copy of the gene of interest has integrated 
into the host cell chromosome adjacent to the gene(s) of step b) ; and 

35 optionally 

f) repeating steps a) to e) at least once using a different chromosomal 
gene(s) in step b) in each repeat. 

Herein genetic tools are also described in the form of DNA constructs 
necessary for carrying out the method of the invention. 
40 Consequently in a second aspect the invention relates to a DNA 

construct comprising: 
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i) an altered non- functional copy of a conditionally essential 
chromosomal gene(s) from a host cell, preferably the copy is partially 
deleted; and 

ii) at least one copy of a gene of interest flanked on one side by i) 

5 and on the other side by a DNA fragment homologous to a host cell DNA 

sequence located on the host cell chromosome adjacent to the 
conditionally essential gene(s) of i) . 

The present invention provides a method for obtaining a host cell 
10 comprising at least two copies of a gene of interest stably integrated on 
the chromosome adjacent to conditionally essential loci. 

Accordingly in a third aspect the invention relates to a host cell 
comprising at least two copies of a gene of interest stably integrated into 
the chromosome, where each copy is integrated adjacent to different 
15 conditionally essential loci, wherein the cell is obtainable by any of the 
methods defined in claims 1-18. 

The method of the invention relies on complementing a conditionally 
essential gene(s) that was rendered non- functional , and a number of suitable 
host cells comprising such non- functional genes are described herein. To 
20 carry out multiple rounds of gene integration according to the invention it 
is advantageous to provide a host cell comprising several non- functional 
conditionally essential genes. 

In a fourth aspect the invention relates to a Bacillus licheniformis 
cell, wherein at least two conditionally essential genes are rendered non- 
25 functional, preferably the genes are chosen from the group consisting of 
xylR, xylA, galE, gntR, gntK, gntP, gntZ, and dal . 

Any host cell as described herein for use in a method of the 
invention is intended to be encompassed by the scope of the invention. 

Consequently in a fifth aspect the invention relates to the use of a 
30 cell as defined in the previous aspect in a method as defined in the first 
aspect . 

As mentioned above, genetic tools of the invention are described 
herein, and it is intended that the scope of the invention comprises such 
constructs when present in or propagated in host cells as is common in the 
35 art . 

Yet another aspect of the invention relates to a cell comprising a 
DNA construct as defined in the second aspect. 

in a final aspect the invention relates to a process for producing an 
enzyme of interest, comprising cultivating a cell as defined in any of the 
40 preceding aspects under conditions appropriate for producing the enzyme, and 
optionally purifying the enzyme. 
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Figures 

Figure 1: Schematic representation of the B. licheniformis xylose 
isomerase region, 
5 PCR fragments, Deletion and Integration plasmids and strains. 

Figure 2: Schematic representation of the B. licheniformis gluconat 

region, 

PCR fragments, Deletion and Integration plasmids and strains. 

Figure 3: Schematic representation of the B. licheniformis D-alanine 
10 racemase encoding region, PCR fragments, Deletion and Integration plasmids 
and strains. 

Definitions 

In accordance with the present invention there may be employed 

15 conventional molecular biology, microbiology, and recombinant DNA techniques 
within the skill of the art. Such techniques are explained fully in the 
literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A 
Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (herein "Sambrook et al . , 1989") DNA 

20 Cloning: A Practical Approach, Volumes I and II /D.N. Glover ed. 1985) ; 

Oligonucleotide Synthesis (M.J. Gait ed. 1984) ; Nucleic Acid Hybridization 
(B.D. Hames & S.J. Higgins eds (1985)); Transcription And Translation {B.D. 
Hames & S.J. Higgins, eds. (1984)); Animal Cell Culture (R.I. Freshney, ed. 
(1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. Perbal, A 

25 Practical Guide To Molecular Cloning (1984) . 

A "polynucleotide" is a single- or double- stranded polymer of 
deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. 
Polynucleotides include RNA and DNA, and may be isolated from natural 
sources, synthesized in vitro, or prepared from a combination of natural and 

30 synthetic molecules. 

A "nucleic acid molecule" or "nucleotide sequence" refers to the 
phosphate ester polymeric form of ribonucleosides (adenosine, guanosine, 
uridine or cytidine; "RNA molecules") or deoxyribonucleosides 
(deoxyadenosine, deoxyguanosine, deoxy thymidine , or deoxycyt idine ; "DNA 

35 molecules") in either single stranded form, or a double -stranded helix. 

Double stranded DNA -DNA, DNA -RNA and RNA -RNA helices are possible. The term 
nucleic acid molecule, and in particular DNA or RNA molecule, refers only to 
the primary and secondary structure of the molecule, and does not limit it 
to any particular tertiary or quaternary forms. Thus, this term includes 

40 double- stranded DNA found, inter alia, in linear or circular DNA molecules 
(e.g., restriction fragments), plasmids, and chromosomes. In discussing the 
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structure of particular double-stranded DNA molecules, sequences may be 
described herein according to the normal convention of giving only the 
sequence in the 5' to 3' direction along the nontranscribed strand of DNA 
(i.e., the strand having a sequence homologous to the mRNA) . A "recombinant 
5 DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation . 

A DNA "coding sequence" or an "open reading frame (ORF) " is a double* 
stranded DNA sequence which is transcribed and translated into a polypeptide 
in a cell in vitro or in vivo when placed under the control of appropriate 

10 regulatory sequences. The ORF "encodes" the polypeptide. The boundaries of 
the coding sequence are determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3' (carboxyl) terminus. A 
coding sequence can include, but is not limited to, prokaryotic sequences, 
cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 

15 mammalian) DNA, and even synthetic DNA sequences. If the coding sequence is 
intended for expression in a eukaryotic cell, a polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding 
sequence . 

An expression vector is a DNA molecule, linear or circular, that 

20 comprises a segment encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. Such additional 
segments may include promoter and terminator sequences, and optionally one 
or more origins of replication, one or more selectable markers, an enhancer, 
a polyadenylation signal, and the like. Expression vectors are generally 

25 derived from plasmid or viral DNA, or may contain elements of both. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that 
provide for the expression of a coding sequence in a host cell. In 
eukaryotic cells, polyadenylation signals are control sequences. 

30 A "secretory signal sequence" is a DNA sequence that encodes a 

polypeptide (a "secretory peptide" that, as a component of a larger 
polypeptide, directs the larger polypeptide through a secretory pathway of a 
cell in which it is synthesized. The larger polypeptide is commonly cleaved 
to remove the secretory peptide during transit through the secretory 

35 pathway. 

The term "promoter" is used herein for its art -recognized meaning to 
denote a portion of a gene containing DNA sequences that provide for the 
binding of RNA polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' non- coding regions 
40 of genes . 



10022.000-DK 



7 

A chromosomal gene is rendered "non- functional" if the polypeptide 
that the gene encodes can no longer be expressed in a functional form. Such 
non- functionality of a gene can be induced by a wide variety of genetic 
manipulations as known in the art, some of which are described in Sambrook 
5 et al, vide supra. Partial deletions within the ORF of a gene will often 
render the gene non- functional , as will mutations. 

^Operably linked", when referring to DNA segments, indicates that the 
segments are arranged so that they function in concert for their intended 
purposes, e.g. transcription initiates in the promoter and proceeds through 
10 the coding segment to the terminator. 

A coding sequence is "under the control" of transcriptional and 
translational control sequences in a cell when RNA polymerase transcribes 
the coding sequence into mRNA, which is then trans -RNA spliced and 
translated into the protein encoded by the coding sequence. 
15 "Heterologous" DNA refers to DNA not naturally located in the cell, or 

in a chromosomal site of the cell. Preferably, the heterologous DNA includes 
a gene foreign to the cell . 

As used herein the term "nucleic acid construct" is intended to 
indicate any nucleic acid molecule of cDNA, genomic DNA, synthetic DNA or 
20 RNA origin. The term "construct" is intended to indicate a nucleic acid 

segment which may be single- or double- stranded, and which may be based on a 
complete or partial naturally occurring nucleotide sequence encoding a 
polypeptide of interest. The construct may optionally contain other nucleic 
acid segments. 

25 The nucleic acid construct of the invention encoding the polypeptide 

of the invention may suitably be of genomic or cDNA origin, for instance 
obtained by preparing a genomic or cDNA library and screening for DNA 
sequences coding for all or part of the polypeptide by hybridization using 
synthetic oligonucleotide probes in accordance with standard techniques (cf. 

30 Sambrook et al . , supra) . 

The nucleic acid construct of the invention encoding the polypeptide 
may also be prepared synthetically by established standard methods, e.g. the 
phosphoamidite method described by Beaucage and Caruthers, Tetrahedron 
Letters 22 (1981), 1859 - 1869, or the method described by Matthes et al . , 

35 EMBO Journal 3 (1984), 801 - 805. According to the phosphoamidite method, 
oligonucleotides are synthesized, e.g. in an automatic DNA synthesizer, 
purified, annealed, ligated and cloned in suitable vectors. 

Furthermore, the nucleic acid construct may be of mixed synthetic and 
genomic, mixed synthetic and cDNA or mixed genomic and cDNA origin prepared 

40 by ligating fragments of synthetic, genomic or cDNA origin (as appropriate) , 
the fragments corresponding to various parts of the entire nucleic acid 
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construct, in accordance with standard techniques. The nucleic acid 
construct may also be prepared by polymerase chain reaction using specific 
primers, for instance as described in US 4,683,202 or Saiki et al . , Science 
239 (1988) , 487 - 491. 
5 The term nucleic acid construct may be synonymous with the term 

^expression cassette" when the nucleic acid construct contains the control 
sequences necessary for expression of a coding sequence of the present 
invention 

The term "control sequences" is defined herein to include all 

10 components that are necessary or advantageous for expression of the coding 
sequence of the nucleic acid sequence. Each control sequence may be native 
or foreign to the nucleic acid sequence encoding the polypeptide. Such 
control sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signal 

15 sequence, and a transcription terminator. At a minimum, the control 

sequences include a promoter, and transcriptional and translational stop 
signals. The control sequences may be provided with linkers for the purpose 
of introducing specific restriction sites facilitating ligation of the 
control sequences with the coding region of the nucleic acid sequence 

20 encoding a polypeptide. 

The control sequence may be an appropriate promoter sequence, a 
nucleic acid sequence that is recognized by a host cell for expression of 
the nucleic acid sequence. The promoter sequence contains transcription and 
translation control sequences that mediate the expression of the 

25 polypeptide. The promoter may be any nucleic acid sequence that shows 

transcriptional activity in the host cell of choice and may be obtained from 
genes encoding extracellular or intracellular polypeptides either homologous 
or heterologous to the host cell . 

The control sequence may also be a suitable transcription terminator 
30 sequence, a sequence recognized by a host cell to terminate transcription. 
The terminator sequence is operably linked to the 3' terminus of the nucleic 
acid sequence encoding the polypeptide. Any terminator which is functional 
in the host cell of choice may be used in the present invention. 
The control sequence may also be a polyadenylation sequence, a sequence 
35 which is operably linked to the 3' terminus of the nucleic acid sequence and 
which, when transcribed, is recognized by the host cell as a signal to add 
polyadenosine residues to transcribed mRNA. Any polyadenylation sequence 
which is functional in the host cell of choice may be used in the present 
invention. 

40 The control sequence may also be a signal peptide -coding region, which 

codes for an amino acid sequence linked to the amino terminus of the 
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polypeptide which can direct the expressed polypeptide into the cell's 
secretory pathway of the host cell. The 5' end of the coding sequence of 
the nucleic acid sequence may inherently contain a signal peptide- coding 
region naturally linked in translation reading frame with the segment of the 
5 coding region which encodes the secreted polypeptide. Alternatively, the 5' 
end of the coding sequence may contain a signal peptide -coding region which 
is foreign to that portion of the coding sequence which encodes the secreted 
polypeptide. A foreign signal peptide- coding region may be required where 
the coding sequence does not normally contain a signal peptide -coding 

10 region. Alternatively, the foreign signal peptide coding region may simply 
replace the natural signal peptide coding region in order to obtain enhanced 
secretion of the polypeptide relative to the natural signal peptide coding 
region normally associated with the coding sequence. The signal peptide- 
coding region may be obtained from a glucoamylase or an amylase gene from an 

15 Aspergillus species, a lipase or proteinase gene from a Rhizomucor species, 
the gene for the alpha-factor from Saccharomyces cerevisiae, an amylase or a 
protease gene from a Bacillus species, or the calf preprochymosin gene. 
However, any signal peptide coding region capable of directing the expressed 
polypeptide into the secretory pathway of a host cell of choice may be used 

20 in the present invention. 

The control sequence may also be a propeptide coding region, which 
codes for an amino acid sequence positioned at the amino terminus of a 
polypeptide. The resultant polypeptide is known as a proenzyme or 
propolypeptide {or a zymogen in some cases) . A propolypeptide is generally 

25 inactive and can be converted to mature active polypeptide by catalytic or 
autocatalytic cleavage of the propeptide from the propolypeptide. The 
propeptide coding region may be obtained from the Bacillus subtilis alkaline 
protease gene (aprE) , the Bacillus subtilis neutral protease gene {nprT) , 
the Saccharomyces cerevisiae alpha-factor gene, or the Myceliophthora 

30 therrnophilum laccase gene (WO 95/33836) . 

It may also be desirable to add regulatory sequences which allow the 
regulation of the expression of the polypeptide relative to the growth of 
the host cell. Examples of regulatory systems are those which cause the 
expression of the gene to be turned on or off in response to a chemical or 

35 physical stimulus, including the presence of a regulatory compound. 

Regulatory systems in prokaryotic systems would include the lac, tac, and 
trp operator systems. Other examples of regulatory sequences are those which 
allow for gene amplification. In eukaryotic systems, these include the 
dihydrofolate reductase gene which is amplified in the presence of 

40 methotrexate, and the metallothionein genes which are amplified with heavy 
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metals. In these cases, the nucleic acid sequence encoding the polypeptide 
would be placed in tandem with the regulatory sequence. 

Examples of suitable promoters for directing the transcription of the 
nucleic acid constructs of the present invention, especially in a bacterial 
5 host cell, are the promoters obtained from the E, coli lac operon, the 
Streptomyces coelicolor agarase gene (dagA) , the Bacillus subtilis 
levansucrase gene {sacB) , the Bacillus subtilis alkaline protease gene, the 
Bacillus licheniformis alpha-amylase gene (amyL) , the Bacillus 
stearothermophilus maltogenic amylase gene {amyM) , the Bacillus 

10 amyloliquefaciens alpha-amylase gene {amyQ) , the Bacillus amyloliquefaciens 
BAN AMYLASE GENE, the Bacillus licheniformis penicillinase gene (penP) , the 
Bacillus subtilis xylA and xylB genes, and the prokaryotic beta-lactamase 
gene {Villa-Kamarof f et al . , 1978, Proceedings of the National Academy of 
Sciences USA 75:3727-3731), as well as the tac promoter (DeBoer et al . , 

15 1983, Proceedings of the National Academy of Sciences USA 80:21-25) . Further 
promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242:74-94; and in Sambrook et a 1 . , 1989, supra. 

The present invention also relates to recombinant expression vectors 
comprising a nucleic acid sequence of the present invention, a promoter, and 

20 transcriptional and translational stop signals. The various nucleic acid 
and control sequences described above may be joined together to produce a 
recombinant expression vector which may include one or more convenient 
restriction sites to allow for insertion or substitution of the nucleic acid 
sequence encoding the polypeptide at such sites. Alternatively, the nucleic 

25 acid sequence of the present invention may be expressed by inserting the 
nucleic acid sequence or a nucleic acid construct comprising the sequence 
into an appropriate vector for expression. In creating the expression 
vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for 

30 expression, and possibly secretion. 

The recombinant expression vector may be any vector (e.g., a plasmid 
or virus) which can be conveniently subjected to recombinant DNA procedures 
and can bring about the expression of the nucleic acid sequence. The choice 
of the vector will typically depend on the compatibility of the vector with 

35 the host cell into which the vector is to be introduced. The vectors may be 
linear or closed circular plasmids. The vector may be an autonomously 
replicating vector, i.e., a vector which exists as an extra chromosomal 
entity, the replication of which is independent of chromosomal replication, 
e.g., a plasmid, an extrachromosomal element, a mini chromosome, or an 

40 artificial chromosome. The vector may contain any means for assuring self- 
replication. Alternatively, the vector may be one which, when introduced 
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into the host cell, is integrated into the genome and replicated together 
with the chromosome (s) into which it has been integrated. The vector system 
may be a single vector or plasmid or two or more vectors or plasmids which 
together contain the total DNA to be introduced into the genome of the host 
5 cell, or a transposon. 

The vectors of the present invention preferably contain one or more 
^selectable markers" which permit easy selection of transformed cells. A 
selectable marker is a gene the product of which provides for biocide, 
antibiotic or viral resistance, resistance to heavy metals, prototrophy to 

10 auxotrophs, and the like. 

A "conditionally essential gene" may function as a n non-ant ibiotic 
selectable marker" . Non-limiting examples of bacterial conditionally 
essential selectable markers are the dal genes from Bacillus subtilis or 
Bacillus lichen! formis, that are only essential when the bacterium is 

15 cultivated in the absence of D-alanine. Also the genes encoding enzymes 
involved in the turnover of UDP-galactose can function as conditionally 
essential markers in a cell when the cell is grown in the presence of 
galactose or grown in a medium which gives rise to the presence of 
galactose. Non-limiting examples of such genes are those from B. subtilis or 

20 B. licheniformis encoding UTP-dependent phosphorylase (EC 2.7.7.10), UDP- 
glucose -dependent uridylyltransf erase (EC 2.7.7.12), or UDP-galactose 
epimerase (EC 5.1.3.2). Also the xylose isomerase genes, xylR and xylA t of 
Bacilli can be used as selectable markers in cells grown in minimal medium 
with xylose as sole carbon source. The genes necessary for utilizing 

25 gluconate, gntR, gntK, gntP, and gntZ can also be used as selectable markers 
in cells grown in minimal medium with gluconate as sole carbon source 

Antibiotic selectable markers confer antibiotic resistance to such 
antibiotics as ampicillin, kanamycin, chloramphenicol, tetracycline, 
neomycin, hygromycin or methotrexate. 

30 Furthermore, selection may be accomplished by co- transformation, 

e.g. , as described in WO 91/17243, where the selectable marker is on a 
separate vector . 

The vectors of the present invention preferably contain an element (s) 
that permits stable integration of the vector, or of a smaller part of the 
35 vector, into the host cell genome or autonomous replication of the vector in 
the cell independent of the genome of the cell. 

The vectors, or smaller parts of the vectors such as amplification 
units of the present invention, may be integrated into the host cell genome 
when introduced into a host cell. For chromosomal integration, the vector 
40 may rely on the nucleic acid sequence encoding the polypeptide or any other 
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element of the vector for stable integration of the vector into the genome 
by homologous or nonhomologous recombination. 

Alternatively, the vector may contain additional nucleic acid 
sequences for directing integration by homologous recombination into the 
5 genome of the host cell. The additional nucleic acid sequences enable the 
vector to be integrated into the host cell genome at a precise location (s) 
in the chromosome (s) . To increase the likelihood of integration at a 
precise location, the integrat ional elements should preferably contain a 
sufficient number of nucleic acids, such as 100 to 1,500 base pairs, 

10 preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 base 
pairs, which are highly homologous with the corresponding target sequence to 
enhance the probability of homologous recombination. The integrat ional 
elements may be any sequence that is homologous with the target sequence in 
the genome of the host cell. Furthermore, the integrational elements may be 

15 non-encoding or encoding nucleic acid sequences. 

On the other hand, the vector may be integrated into the genome of the 
host cell by non- homologous recombination. These nucleic acid sequences may 
be any sequence that is homologous with a target sequence in the genome of 
the host cell, and, furthermore, may be non-encoding or encoding sequences. 

20 The copy number of a vector, an expression cassette, an amplification unit, 
a gene or indeed any defined nucleotide sequence is the number of identical 
copies that are present in a host cell at any time. A gene or another 
defined chromosomal nucleotide sequence may be present in one, two, or more 
copies on the chromosome. An autonomously replicating vector may be present 

25 in one, or several hundred copies per host cell. 

For autonomous replication, the vector may further comprise an origin 
of replication enabling the vector to replicate autonomously in the host 
cell in question. Examples of bacterial origins of replication are the 
origins of replication of plasmids pBR322, pUC19, pACYC177, pACYC184, 

30 pUBHO, pE194, pTA1060, and pAMfcl . The origin of replication may be one 
having a mutation which makes its functioning temperature -sensitive in the 
host cell (see, e.gr., Ehrlich, 1978, Proceedings of the National Academy of 
Sciences USA 75:1433). 

The present invention also relates to recombinant host cells, comprising a 
35 nucleic acid sequence of the invention, which are advantageously used in the 
recombinant production of the polypeptides. The term "host cell" 
encompasses any progeny of a parent cell which is not identical to the 
parent cell due to mutations that occur during replication. 

The cell is preferably transformed with a vector comprising a nucleic 
40 acid sequence of the invention followed by integration of the vector into 
the host chromosome. "Transformation" means introducing a vector comprising 
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a nucleic acid sequence of the present invention into a host cell so that 
the vector is maintained as a chromosomal integrant or as a self -replicating 
extra-chromosomal vector. Integration is generally considered to be an 
advantage as the nucleic acid sequence is more likely to be stably 
5 maintained in the cell. Integration of the vector into the host chromosome 
may occur by homologous or non -homologous recombination as described above. 

The choice of a host cell will to a large extent depend upon the gene 
encoding the polypeptide and its source. The host cell may be a unicellular 
microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., 

10 a eukaryote. Useful unicellular cells are bacterial cells such as gram 
positive bacteria including, but not limited to, a Bacillus cell, e.g., 
Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus 
circulans. Bacillus coagulans, Bacillus lautus. Bacillus lentus, Bacillus 
licheniformis, Bacillus megaterium, Bacillus stearothermophilus , Bacillus 

15 subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., 

Streptomyces lividans or Strep tomyces murinus, or gram negative bacteria 
such as E. coli and Pseudomonas sp. In a preferred embodiment, the 
bacterial host cell is a Bacillus lentus, Bacillus licheniformis, Bacillus 
stearothermophilus or Bacillus subtilis cell. 

20 The transformation of a bacterial host cell may, for instance, be 

effected by protoplast transformation (see, e.g., Chang and Cohen, 1979, 
Molecular General Genetics 168:111-115), by using competent cells (see, 
e.g., Young and Spizizin, 1961, Journal of Bacteriology 81:823-829, or 
Dubnar and Davidoff- Abel son, 1971, Journal of Molecular Biology 56:209-221), 

25 by electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechnigues 

6:742-751), or by conjugation (see, e.g., Koehler and Thome, 1987, Journal 
of Bacteriology 169:5771-5278). 

The transformed or transfected host cells described above are cultured 
in a suitable nutrient medium under conditions permitting the expression of 

30 the desired polypeptide, after which the resulting polypeptide is recovered 
from the cells, or the culture broth. 

The medium used to culture the cells may be any conventional medium suitable 
for growing the host cells, such as minimal or complex media containing 
appropriate supplements. Suitable media are available from commercial 

35 suppliers or may be prepared according to published recipes (e.g. in 

catalogues of the American Type Culture Collection) . The media are prepared 
using procedures known in the art (see, e.g., references for bacteria and 
yeast; Bennett, J.W. and LaSure, L . , editors, More Gene Manipulations in 
Fungi, Academic Press, CA, 1991) . 

40 If the polypeptide is secreted into the nutrient medium, the 

polypeptide can be recovered directly from the medium. If the polypeptide 
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is not secreted, it is recovered from cell lysates . The polypeptide are 
recovered from the culture medium by conventional procedures including 
separating the host cells from the medium by centrif ugation or filtration, 
precipitating the proteinaceous components of the supernatant or filtrate by 
5 means of a salt, e.g. ammonium sulphate, purification by a variety of 

chromatographic procedures, e.g. ion exchange chromatography, gelf iltration 
chromatography, affinity chromatography, or the like, dependent on the type 
of polypeptide in question. 

The polypeptides may be detected using methods known in the art that 

10 are specific for the polypeptides. These detection methods may include use 
of specific antibodies, formation of an enzyme product, or disappearance of 
an enzyme substrate. For example, an enzyme assay may be used to determine 
the activity of the polypeptide. 

The polypeptides of the present invention may be purified by a variety 

15 of procedures known in the art including, but not limited to, chromatography 
(e.g., ion exchange, affinity, hydrophobic, chromatof ocusing, and size 
exclusion), electrophoretic procedures (e.g., preparative isoelectric 
focusing (IEF) , differential solubility (e.g., ammonium sulfate 
precipitation), or extraction (see, e.g., Protein Purification, J.-C. Janson 

20 and Lars Ryden, editors, VCH Publishers, New York, 1989) . 



Detailed description of the invention 

A method for constructing a cell comprising at least two copies of a 
gene of interest stably integrated into the chromosome in different 
25 positions according to the first aspect of the invention. 

In the method of the invention it is envisioned that after the 
directed and selectable integration of the DNA construct into the chromosome 
of the host cell by the first homologous recombination, a second 
recombination can take place between a DNA fragment comprised in the 
30 construct and a homologous host cell DNA sequence located adjecent to the 
gene(s) of step b) of the method of the first aspect, where the DNA fragment 
of the construct is homologous to said host cell DNA sequence. 

Accordingly a preferred embodiment of the invention relates to the 
method of the first aspect, wherein subsequent to step d) and prior to step 
35 e) a second recombination takes place between the DNA fragment and the 
homologous host cell DNA sequence. 

Further it is envisioned that one might add a marker gene to the DNA 
construct, which could ease selection of first recombination integrants, 
where the marker gene would be excised from the host cell chromosome again 
40 by the second recombination as described above. 
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In a preferred embodiment the invention relates to the method of the 
first aspect, where the DNA construct further comprises a marker gene which 
is located in the construct such that it is recombined out of the chromosome 
by the second recombination; preferably 
5 the marker gene confers resistance to an antibiotic, more preferably the 
antibiotic is chosen from the group consisting of chloramphenicol, 
kanamycin, ampillin, erythromycin, spectinomycin and tetracycline; and most 
preferably a host cell is selected which grows under the conditions of step 
d) of the first aspect and which does not contain the marker gene in the 
10 chromosome. 

The method of the invention can also be carried out by including a 
marker gene in that part of the DNA construct which remains integrated in 
the chromosome after the second recombination event. However as it is 
preferred not to have marker genes in the chromosome, an alternative way of 

15 removing the marker gene must be employed after the integration has been 

carried out. Specific restriction enzymes or resolvases that excise portions 
of DNA, if it is flanked on both sides by certain recognition sequences 
known as resolvase sites or res-sites, are well known in the art. 

A preferred embodiment of the invention relates to the method of the 

20 first aspect, where the DNA construct further comprises a marker gene 
located between the altered copy and the DNA fragment, and wherein the 
marker gene is flanked by nucleotide sequences that are recognized by a 
specific resolvase, preferably the nucleotide sequences are res; even more 
preferably the marker gene is excised from the chromosome by the action of a 

25 resolvase enzyme after step e) and prior to step f) in the first aspect. 

The gene of interest may encode an enzyme that is naturally produced 
by the host cell, indeed one may simply want to increase the number of 
copies of a gene endogenous to the host cell. 

Accordingly a preferred embodiment of the invention relates to the 

30 method of the first aspect, wherein the gene of interest originates from the 
host cell. 

In another preferred embodiment the invention relates to the method 
of the first aspect, wherein the gene of interest encodes an enzyme, 
preferably an amylolytic enzyme, a lipolytic enzyme, a proteolytic enzyme, a 

35 cellulytic enzyme, an oxidoreductase or a plant cell-wall degrading enzyme, 
and more preferably an enzyme with an activity selected from the group 
consisting of aminopeptidase, amylase, amyloglucosidase, carbohydrase , 
carboxypeptidase, catalase, cellulase, chitinase, cutinase, cyclodextrin 
glycosyltransf erase, deoxyribonuc lease, esterase, galactosidase, beta- 

40 galactosidase, glucoamylase, glucose oxidase, glucosidase, haloperoxidase, 
hemicellulase, invertase, isomerase, laccase, ligase, lipase, lyase, 
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mannosidase, oxidase, pectinase, peroxidase, phytase, phenoloxidase, 
polyphenoloxidase, protease, ribonuclease, transferase, transglutaminase, or 
xylanase . 

As mentioned above, the gene of interest may be endogenous to the 
5 host cell, however it may even be more advantageous if the production cell 
obtained by the method of the invention contains no foreign DNA at all, when 
the integration procedure is completed. 

Yet a preferred embodiment of the invention relates to the method of 
the first aspect, wherein the host cell selected in step e) of the first 
10 aspect comprises DNA only of endogenous origin. 

Many ways exist in the art of rendering a gene non- functional by 
manipulation, such as partially deleting the gene or the promoter of the 
gene, or by introducing mutations in the gene or the promoter region of the 
gene . 

15 A preferred embodiment of the invention relates to the method of the 

first aspect, wherein the chromosomal gene(s) of the host cell is altered by 
partially deleting the gene, or by introducing mutations in the gene(s). 

The present invention relies on rendering a chromosomal gene(s) of 
the host cell non-f unctional in a step, and in particular relies on a number 

20 of conditionally essential genes to be rendered non -functional . The gene(s) 
may be rendered non-f unctional by a partial deletion or a mutation as known 
in the art; specifically the gene(s) may be rendered non- functional through 
the use of a "Deletion plasmid(s)" as shown herein in non-limiting examples 
below. For each of the preferred embodiments relating to the altered 

25 chromosomal gene(s) of step b) of the method of the first aspect, the most 
preferred embodiment is shown by non-limiting examples herein and reference 
is made to the genetic tools constructed for that purpose, such as the PCR 
primer sequences used for constructing the "Deletion plasmid(s)". 

Accordingly a preferred embodiment of the invention relates to the 

30 method of the first aspect, wherein the chromosomal gene(s) of the host cell 
that is altered in step b) encodes a D-alanine racemase, preferably the 
gene(s) is a da2 homologue from a Bacillus cell, more preferably the gene is 
homologous to dal from Bacillus subtilis, and most preferably the gene(s) is 
the dal gene of Bacillus licheniformis* 

35 Another preferred embodiment of the invention relates to the method 

of the first aspect, wherein the chromosomal gene(s) of the host cell that 
is altered in step b) encodes a D-alanine racemase and is at least 75% 
identical, preferably 80% identical, or preferably 85% identical, more 
preferably 90% identical, or more preferably 95% and most preferably at 

40 least 97% identical to the dal sequence of Bacillus licheniformis shown in 
positions 1303 to 2469 in SEQ ID 12. 
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A preferred embodiment of the invention relates to the method of the 
first aspect, wherein the chromosomal gene(s) of the host cell that is 
altered in step b) is one or more genes of a xylose isomerase operon, 
preferably the gene{s) is homologous to the xylR gene or the xylA gene from 
5 Bacillus subtilis, more preferably the gene(s) is homologous to both xylR 
and xylA, and most preferably the gene(s) is homologous to one or more genes 
of the xylose isomerase operon of Bacillus licheniformis. 

A preferred embodiment of the invention relates to the method of the 
first aspect, wherein the chromosomal gene(s) of the host cell that is 

10 altered in step b) encodes a galactokinase (EC 2.7.1.6), an UTP-dependent 
pyrophosphorylase (EC 2.7.7.10), an UDP- glucose -dependent 
uridylyltransferase (EC 2.7.7.12), or an UDP-galactose epimerase (EC 
5.1.2.3), preferably the gene(s) encodes an UDP-galactose epimerase (EC 
5.1.2.3), more preferably the gene(s) is homologous to galE of a Bacillus, 

15 and most preferably the gene is galE of Bacillus licheniformis. 

A preferred embodiment of the invention relates to the method of the 
first aspect, wherein the chromosomal gene(s) of the host cell that is 
altered in step b) is one or more genes of a gluconate operon, preferably 
the gene(s) encodes a gluconate kinase (EC 2.7.1.12) or a gluconate permease 

20 or both, more preferably the gene(s) is one or more genes homologous to 

genes from Bacillus subtilis chosen from the group consisting of gntR, gntK, 
gntP, and gntZ, and most preferably the gene(s) is one or more genes of 
gntR, gntK, gntP, and gntZ from Bacillus licheniformis. 

As described herein the method of the invention is very relevant for 

25 the biotech industry and a number of preferred organisms are very well known 
in this industry, especially Gram positive host cells, and certainly host 
cells of the Bacillus genus, specifically Bacillus alkalophilus, Bacillus 
amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, 
Bacillus coagulans, Bacillus lautus, Bacillus lentus t Bacillus 

30 licheniformis, Bacillus megaterium, Bacillus stearothermophilus , Bacillus 
subtilis, and Bacillus thuringiensis . 

A preferred embodiment of the invention relates to the method of the 
first aspect, wherein the host cell is a Gram-positive bacterial cell, 
preferably a Bacillus cell, and most preferably a Bacillus cell chosen from 

35 the group consisting of Bacillus alkalophilus, Bacillus amyloliquefaciens, 
Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, 
Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus 
megaterium, Bacillus stearothermophilus , Bacillus subtilis, and Bacillus 
thuringi ensi s . 

40 Another preferred embodiment of the invention relates to the method 

of the first aspect, wherein the DNA construct is a plasmid. 
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As described elsewhere herein, the present invention provides genetic 
tools for carrying out the method of the invention, such as host cells, and 
DNA constructs of the invention, such as a DNA construct of the second 
aspect comprising: 
5 i) an altered non- functional copy of a conditionally essential 

chromosomal gene(s) from a host cell, preferably the copy is partially 
deleted; and 

ii) at least one copy of a gene of interest flanked on one side by i) 
and on the other side by a DNA fragment homologous to a host cell DNA 

10 sequence located on the host cell chromosome adjecent to the 

conditionally essential gene(s) of i) . 
A preferred embodiment of the invention relates to the DNA construct 
of the second aspect, wherein the chromosomal gene(s) of the host cell that 
is altered in i) encodes a D-alanine racemase, preferably the gene(s) is a 

15 dal homologue from a Bacillus cell, more preferably the gene is homologous 
to dal from Bacillus subtilis, and most preferably the gene is the dal gene 
of Bacillus licheniformis , 

Another preferred embodiment of the invention relates to the DNA 
construct of the second aspect, wherein the chromosomal genets) of the host 

20 cell that is altered in i) encodes a D-alanine racemase and is at least 75% 
identical, preferably 80% identical, or preferably 85% identical, more 
preferably 90% identical, or more preferably 95% and most preferably at 
least 97% identical to the dal sequence of Bacillus licheniformis shown in 
positions 1303 to 2469 in SEQ ID 12. 

25 Yet another preferred embodiment of the invention relates to the DNA 

construct of the second aspect, wherein the chromosomal gene{s) of the host 
cell that is altered in i) is one or more genes of a xylose isomerase 
operon, preferably the gene(s) is homologous to the xylR gene or the xylA 
gene from Bacillus subtilis, more preferably the gene(s) is both a xylR and 

30 a xylA homologue, and most preferably the gene(s) is homologous to one or 
more genes of the xylose isomerase operon of Bacillus licheniformis. 

Still another preferred embodiment of the invention relates to the 
DNA construct of the second aspect, wherein the chromosomal gene (s) of the 
host cell that is altered in i) encodes a galactokinase (EC 2.7.1.6), an 

35 UTP-dependent pyrophosphorylase (EC 2.7.7.10), an UDP-glucose- dependent 
uridylyltransf erase (EC 2.7.7.12), or an UDP-galactose epimerase (EC 
5.1.2.3), preferably the gene(s) encodes an UDP-galactose epimerase (EC 
5.1.2.3), more preferably the gene(s) is homologous to the galE gene of 
Bacillus subtilis, and most preferably the gene(s) is the galE gene of 

40 Bacillus licheniformis. 
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One more preferred embodiment of the invention relates to the DNA 
construct of the second aspect, wherein the chromosomal gene(s) of the host 
cell that is altered in i) is one or more genes of a gluconate operon, 
preferably the gene(s) encodes a gluconate kinase (EC 2.7.1.12) or a 
5 gluconate permease or both, more preferably the gene(s) is homologous to one 
or more genes from Bacillus subtilis chosen from the group consisting of 
gntR, gntK, gntP, and gntZ, and most preferably the gene(s) is one or more 
genes of gntR, gntK, gntP, and gntZ from Bacillus licheniformis. 

The present invention provides a method for constructing a production 

10 host cell that is very useful to the biotech industry, such as a host cell 
of the third aspect comprising at least two copies of a gene of interest 
stably integrated into the chromosome, where each copy is integrated 
adjecent to different conditionally essential loci, wherein the cell is 
obtainable by any of the methods defined in the first aspect. 

15 The method of the first aspect describes the integration of a gene of 

interest into the chromosome of a host cell, so that the gene of interest is 
integrated in a position that is adjecent to the conditionally essential 
locus. The exact relative positions of the gene of interest and the locus 
are not of major relevance for the method, however generally speaking it is 

20 of interest to minimize the distance in basepairs separating the two, both 
to achieve a more stable integration, but also to minimize the integration 
of superfluous DNA sequence into the host cell genome. 

Accordingly a preferred embodiment of the invention relates to the 
host cell of the third aspect, wherein the gene of interest is separated 

25 from the conditionally essential locus by no more than 1000 basepairs, 
preferably no more than 750 basepairs, more preferably no more than 500 
basepairs, even more preferably no more than 250 basepairs, and most 
preferably no more than 100 basepairs. 

As mentioned above, it is of interest to minimize the presence of 

30 integrated or superfluous DNA sequence in the host cell genome, especially 
DNA of exogenous origin, and the ideal host cell contains only DNA of 
endogenous origin such as multiple copies of an endogenous gene of interest 
integrated in different well defined chromosomal locations. 

Consequently a preferred embodiment of the invention relates to the 

35 host cell of the third aspect, which contains only endogenous DNA. 

Certain bacterial strains are preferred as host cells in the biotech 
industry as mentioned previously. 

A preferred embodiment of the invention relates to the host cell of 
the third aspect, which is a Gram-positive bacterial cell, preferably a 

40 Bacillus cell, and most preferably a Bacillus cell chosen from the group 
consisting of Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus 
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brevis, Bacillus circulans, Bacillus clausii t Bacillus coagulans t Bacillus 
lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, 
Bacillus stearothermophilus , Bacillus subtilis, and Bacillus thuringiensis . 
Another preferred embodiment of the invention relates to the host 
5 cell of the third aspect, wherein a copy of the gene of interest is 

integrated adjecent to a gene encoding a D-alanine racemase, preferably a 
gene homologous to the dal gene from Bacillus subtilis, more preferably a 
gene at least 75% identical to the dal sequence of Bacillus licheniformis 
shown in positions 1303 to 2469 in SEQ ID 12, even more preferably 80% 

10 identical, or even more preferably a gene at least 85% identical, still more 
preferably 90% identical, more preferably at least 95% identical, and most 
preferably at least 97% identical to the dal sequence of Bacillus 
licheniformis shown in positions 1303 to 2469 in SEQ ID 12. 

Yet another preferred embodiment of the invention relates to the host 

15 cell of the third aspect, wherein a copy of the gene of interest is 
integrated adjecent to a gene of a xylose isomerase operon, preferably 
adjecent to genes homologous to the xylR or xylA genes from Bacillus 
subtilis, and most preferably adjecent to xylR or xylA from Bacillus 
licheni formi s . 

20 One more preferred embodiment of the invention relates to the host 

cell of the third aspect, wherein a copy of the gene of interest is 
integrated adjecent to a gene encoding a galactokinase (EC 2.7.1.6), an UTP- 
dependent pyrophosphorylase (EC 2.7.7.10), an UDP-glucose- dependent 
uridylyl transferase (EC 2.7.7.12), or an UDP-galactose epimerase (EC 

25 5.1.2.3), preferably adjecent to a gene encoding an UDP-galactose epimerase 
(EC 5.1.2.3), more preferably adjecent to a gene homologous to the galE gene 
from Bacillus subtilis t and most preferably adjecent to galE from Bacillus 
1 i cheni formi s . 

An additional preferred embodiment of the invention relates to the 
30 host cell of the third aspect, wherein a copy of the gene of interest is 

integrated adjecent to a gene of a gluconate operon, preferably adjecent to 
a gene that encodes a gluconate kinase (EC 2.7.1.12) or a gluconate 
permease, more preferably adjecent to a gene homologous to a Bacillus 
subtilis gene chosen from the group consisting of gntR, gntK, gntP, and 
35 gntz, and most preferably adjecent to gntR, gntK, gntP, or gntz from 
Bacillus licheniformis. 

The host cell of the third aspect is especially interesting for the 
industrial production of polypeptides such as enzymes. 

A preferred embodiment of the invention relates to the host cell of 
40 the third aspect, wherein the gene of interest encodes an enzyme, preferably 
an amylolytic enzyme, a lipolytic enzyme, a proteolytic enzyme, a cellulytic 
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enzyme, an oxidoreductase or a plant cell -wall degrading enzyme, and more 
preferably an enzyme selected from the group consisting of aminopeptidase , 
amylase, amyloglucosidase, carbohydrase, carboxypeptidase, catalase, 
cellulase, chitinase, cutinase, cyclodextrin glycosyltransf erase, 
5 deoxyribonuc lease, esterase, galactosidase, beta-galactosidase, 

glucoamylase, glucose oxidase, glucosidase, haloperoxidase, hemicellulase, 
invertase, isomerase, laccase, ligase, lipase, lyase, mannosidase, oxidase, 
pectinase, peroxidase, phytase, phenoloxidase, polyphenoloxidase, protease, 
ribonuclease, transferase, transglutaminase, or xylanase. 
10 A further preferred embodiment of the invention relates to the host 

cell of the third aspect, wherein no antibiotic markers are present. 

The present invention teaches the construction of host cells that are 
suitable for use in the method of the first aspect, especially host cells 
wherein one, two or more conditionally essential genes are rendered non- 
15 functional. In non-limiting examples below is shown how the preferred 

conditionally essential genes of the invention are rendered non- functional 
through a partial deletion by using specific Deletion Plasmids of the 
invention. Specifically the present invention relates to a Bacillus 
licheniformis cell of the fourth aspect, wherein at least two conditionally 
20 essential genes are rendered non- functional , preferably the genes are chosen 
from the group consisting of xylR, xylA, galE, gntR, gntK, gntP, gntZ, and 
dal. 

The use of such a host cell of the fourth aspect is likewise 
envisioned in the method of the first aspect. 
25 Another genetic tool provided by the present invention for the method 

of the first aspect, is a host cell comprising a DNA construct of the second 
aspect . 

A final aspect of the invention relatest to a process for producing 
an enzyme of interest, comprising cultivating a cell of the third aspect 
30 under conditions appropriate for producing the enzyme, and optionally 
purifying the enzyme. 

Examples 

35 

Example 1 

Bacillus licheniformis SJ4671 {WO 99/41358) comprises two stably 
integrated amyL gene copies in its chromosome, inserted in opposite relative 
orientations in the region of the B. licheniformis alpha-amylase gene, amyL. 
40 The following example describes the insertion into this strain of a third 
amyL gene copy by selectable , directed integration into another defined 
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region of the B. licheniformis chromosome resulting in a strain comprising 
three stable chromosomal copies of the amyL gene but which is devoid of 
foreign DNA. 

5 Xylose isomerase deletion/integration outline (Figure 1) 

The sequence of the Bacillus lichenformis xylose isomerase region is 
available in GenBank/EMBL with accession number Z80222 . 

A plasmid denoted "Deletion plasmid" was constructed by cloning two 
PCR amplified fragments from the xylose isomerase region on a temperature- 

10 sensitive parent plasmid. The PCR fragments were denoted "A" and tt B" , 
wherein A comprises the xylR promoter and part of the xylR gene; and B 
comprises an internal fragment of xylA missing the promoter and the first 70 
basepairs of the gene. A spectinomycin resistance gene flanked by resolvase 
(res) sites was introduced between fragments A and B on the plasmid. This 

15 spectinomycin resistance gene could later be removed by resolvase-mediated 
site-specific recombination. 

The xylose isomerase deletion was transferred from the Deletion 
plasmid to the chromosome of a Bacillus target strain by double homologous 
recombination via fragments A and B, mediated by integration and excision of 

20 the temperature- sensitive plasmid. The resulting strain was denoted 
"Deletion strain" . This strain is unable to grow on minimal media with 
xylose as sole carbon source. 

An * Integration plasmid" was constructed for insertion of genes into 
the xylose isomerase region of the Deletion strain. We intended to PCR- 

25 amplify a fragment denoted tt C" comprising the xylA promoter and about 1 kb 
of the xylA gene. However, as later described, only a smaller fragment 
denoted W D" comprising the xylA promoter and the first 250 basepairs of the 
xylA gene was succesfully amplified and cloned. The Integration plasmid 
comprises fragments A and D on a temperature -sensitive vector. An expression 

30 cassette was also cloned in the Integration plasmid between fragments A and 
D. 

The temperature -sensitive Integration plasmid was transferred to the 
B. licheniformis Deletion strain and it integrated in the chromosome; 
subsequent excision of the temperature sensitive vector was ensured, and 
35 "Integration strains" could then be isolated which grow on minimal media 
with xylose as sole carbon source. Such Integration strains have restored 
the chromosomal xylA gene, by double homologous recombination via fragments 
A and D. In this process, the expression cassette has been integrated into 
the chromosome. 

40 

Plasmid constructs 
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PCR amplifications were performed with Ready-To-Go PCR Beads from 
amersham pharmacia biotech as described in the manufacturers instructions, 
using an annealing temperature of 55 0 C. 

5 Plasmids pSJ5128 and pSJ5129: 

The A fragment {xylR promoter and part of the xylR gene) was 
amplified from Bacillus licheniformis PL1980 chromosomal DNA using primers: 

#183235; [Hindlll <-Z80222 1242-1261-*] (SEQ 10 1): 
10 5 ' - GACTAAGCTTCTGCATAGTGAGAGAAGACG 

#183234: [EcoRI; Bgrlll; Wot I ; Mlul ; Sail; Seal ^Z80222 2137-2113->] (SEQ ID 
2) : 

5 ' -GACTGAATTCAGATCTGCGGCCGCACGCGTGTCGACAGTACTGAAATAGAGGAAAAAATAAGTTTTC 

15 

The PCR fragment was digested with EcoRI and Hindlll and purified, 
then ligated to EcoRI and Hindlll digested pUC19* The ligation mixture was 
transformed by electroporation into E. coli SJ2 , and transf ormants were 
selected for ampicillin resistance (200 /ig/ml) . The PCR-f ragments of three 
20 such ampicillin resistant transformants were sequenced and all were found to 
be correct. Two clones designated SJ5128 (SJ2/pSJ5128) and SJ5129 
(SJ2/pSJ5129) were kept. 

Plasmids pSJ5124, pSJ5125: 
25 The B fragment (an internal part of xylA, missing the promoter and the first 
70 basepairs of the coding region) , was amplified from B. licheniformis 
PL1980 chromosomal DNA using primers: 

#183230 [EcoRI «-Z80222 3328-3306-*] (SEQ ID 3) : 
30 5 ' - GACTGAATTCCGTATCCATTCCTGCGATATGAG 

#183227 [Ba/riHI; Bglll ^Z80222 2318*2342-*] (SEQ ID 4) : 
5' -GACTGGATCCAGATCTTATTACAACCCTGATGAATTTGTCG 

35 The PCR fragment was digested with £coRl and BajriHl, and purified, 

then ligated to EcoRI + BarnHI digested pUC19 and transformed by 
electroporation into E. coli S J2 . Transformants were selected for ampicillin 
resistance (200 /xg/ml) . Two clones were correct as confirmed by DNA 
sequencing, and were kept as SJ5124 (SJ2/pSJ5124) and SJ5125 (SJ2/pSJ5125) . 

40 

Plasmid pSJ5130: 
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The C fragment (comprising the xylA promoter and about 1 kb of the 
xylA gene) was PCR amplified from B. licheniformis PL1980 chromosomal DNA 
using primers: 

5 #18323 0 (see above, SEQ ID 3) 

#183229 [BamHI; Bglll; Nftel; Clal ; SacII <-Z80222 2131-2156-*] (SEQ ID 5): 
5 ' -GACTGGATCCAGATCTGCTAGCATCGATCCGCGGCTATTTCCATTGAAAGCGATTAATTG 

10 The PCR fragment was digested with KcoRI and BazriHI and purified, then 

ligated to EcoRI and BamHI digested pUC19 and transformed by 
electroporation, into E, coli S J2 . Transf ormants were selected for 
ampicillin resistance (200 pig/ml) . One clone, comprising the full-length PCR 
fragment, was found to have a single basepair deletion in the promoter 

15 region, between the -35 and -10 sequences. This transf ormant was kept as 
SJ5130 (SJ2/pSJ5130) . 

Plasmid pSJS131: 

This plasmid was constructed as pSJ5130, above, but turned out to 
20 contain a 400 basepair PCR fragment only (the D fragment) , comprising the 
xylA promoter and the first 250 basepairs of the xylA coding sequence. DNA 
sequencing confirmed that the no sequence errors were present in the 
fragment. The transformant was kept as SJ5131 (SJ2/pSJ5131) . 

25 Plasmids pSJ5197, pSJ5198: 

These plasmids comprise the A [xylR) fragment on a temperature- 
sensitive, mobilizable vector. They were constructed by ligating the 0 . 9 kb 
Bgl I I -Hindi II fragment from pSJ5129 to the 5.4 kb Bglll -Hindlll fragment 
from pSJ2739, and transforming B. subtilis DN1885 competent cells with the 

30 ligation mix followed by selecting for erythromycin resistance (5 //g/ml) . 
Two clones were kept, SJ5197 (DN1885/pSJ5197 ) and SJ5198 (DN1885/pSJ5198 ) . 

Plasmids pSJ5211, pSJ5212: 

These plasmids contain a res-spc~res cassette inserted next to the B 

35 fragment. They were constructed by ligating the 1.5 kb BclI-Ba/riHI fragment 
from pSJ3358 into the Bglll site of pSJ5124, and transforming the ligation 
mix into E. coll SJ2 and selecting for ampicillin resistance (200 /ig/ml) and 
spectinomycin resistance (120 /ig/ml) resistance. Two clones were kept, 
wherein the res-spc-res cassette was inserted in either of the possible 

40 orientations, SJ5211 (SJ2/pSJ5211) and SJ5212 (SJ2/pSJ5212 > . 
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The Deletion plasmid 

Plasmid pSJ5218: 

This plasmid contains the res-spc-res cassette flanked by the A and B 
fragments. It was constructed by ligating the 2.5 kb EcoRI-BamHl fragment 
5 from pSJ5211 to the 5.3 kb JScoRI-Bglll fragment from pSJ5197, and 

transforming the ligation mix into B. subtilis DN1885 and selecting for 
erythromycin (5 //g/ml) and spectinomycin resistance (120 /xg/ml) resistance 
at 30°C. One transf ormant , SJ5218 (DN1885/pSJ5218) was kept. 

10 The Integration plasmids 

Plasmids pSJ5247, pSJ5248: 

These plasmids comprise the short 4 00 basepairs D fragment (PxylA- 
xylA) as well as the A fragment {xylR) on a temperature- sensitive, 
mobilizable vector. They were made by ligating the 0.4 kb BgIII-x7coRI 
15 fragment from pSJ5131 to the 5.3 kb Bglll-EcoRI fragment from pSJ5197, and 
transforming the ligation mix into B. subtilis DN1885 and selecting for 
erythromycin resistance (5 /xg/ml) at 3 0°C. Two transf ormants, SJ5247 
(DN1885/pSJ5247) and SJ5248 (DN1885/pSJ5248) were kept. 

20 Construction of strains with chromosomal xylA deletions. 

The deletion' plasmid pSJ5218 was transformed into competent cells of 
the B. subtilis conjugation donor strain PP289-5 (which contains a 
chromosomal dal -deletion, and plasmids pBCIS and pLS2 0) , transf ormants were 
selected for resistance to spectinomycin (120 //g/ml) , erythromycin (5 //g/ml) 
25 and tetracycline (5 /xg/ml) on plates with D-alanine (100 /xg/ml) at 30°C. Two 
transf ormants were kept, SJ5219 and SJ5220. 

The two-copy B. lichenifoirmis alpha-amylase strain SJ4671, described 
in WO 99/41358 was used as recipient in conjugations. 

Donor strains SJ5219 and SJ5220 were grown overnight at 30°C on LBPSG 
30 plates (LB plates with phosphate (0.01 M K 3 PO«) , glucose (0.4 %) , and starch 
(0.5 %) ) supplemented with D-alanine (100 /ig/ml), spectinomycin (120 /xg/ml), 
erythromycin (5 //g/ml) and tetracycline (5 //g/ml) . The recipient strain was 
grown overnight on LBPSG plates. 

An inoculation needle loopful of donor and recipient were mixed on 
35 the surface of a LBPSG plate with D-alanine (100 /xg/ml) , and incubated at 
30°C for 5 hours. This plate was then replicated onto LBPSG with 
erythromycin (5 //g/ml) and spectinomycin (120 /xg/ml) , and incubation was at 
30°C for 2 days. These four conjugations resulted in between 13 and 25 
trans con jugants . 

40 Tetracycline-sensitive (indicating absence of pBC16) transcon jugants 

were re isolated on LBPSG with erythromycin (5 /ig/ml) and spectinomycin (120 
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/ig/ml) at 50°C, incubated overnight, and single colonies from the 50°C plates 
were inoculated into 10 ml TY liquid cultures and incubated with shaking at 
26°C for 3 days. Aliquots were then transferred into fresh 10 ml TY cultures 
and incubation proceeded overnight at 3 0°C. The cultures were plated on 
5 LBPSG with 120 /zg/ml spectinomycin, after overnight incubation at 30°C these 
plates were replica plated onto spectinomycin and erythromycin, 
respectively, and erythromycin sensitive, spectinomycin resistant isolates 
were obtained from all strain conjugations. 

The following strains, containing the chromosomal xylA promoter and 
10 the first 70 basepairs of the xylA coding sequence replaced by the res-spc- 
res cassette, were kept: 



SJ5231: SJ4671 recipient, SJ5219 donor. 
SJ5232: SJ4671 recipient, SJ5220 donor. 

15 

Strain phenotypes were assayed on TSS minimal medium agar plates, 
prepared as follows. 4 00 ml H 2 0 and 10 g agar is autoclaved at 121°C for 20 
minutes, and allowed to cool to 60°C. The following sterile solutions are 
added : 

20 

1 M Tris pH 7.5 

2 % FeCl 3 .6H 2 0 

2 % trisodium citrate dihydrate 
l M K 2 HP0 4 
25 10 % MgSC 4 .7H 2 0 
10 % glutamine 
20 % glucose 
15 % xylose 



25 ml 
1 ml 
1 ml 
1.25 ml 
1 ml 

10 ml ; and 
12.5 ml; or 
16.7 ml 



30 Bacillus licheniformis SJ4671 grows well on both glucose and xylose 

TSS plates, forming brownish coloured colonies. 

The xylA deletion strains SJ5231-SJ5232 grow well on glucose TSS 
plates, but only a very thin, transparent growth is formed on the TSS xylose 
plates, even after prolonged incubation. These strains are clearly unable to 

35 use xylose as the sole carbon source. 



Directed and selectable integration into the xyl region. 

Integration plasmid pSJ5247 (containing the D and A fragments) , and 
40 as a negative control pSJ5198 (containing only the A fragment) were 

transformed into competent cells of the B . subtilis conjugation donor strain 
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PP289-5 (which contains a chromosomal dal-delet ion, and plasmids pBCl6 and 
pLS2 0) , transf ormants were selected for resistance to erythromycin (5 /ig/ml) 
and tetracycline (5 /zg/ml) on plates with D-alanine (100 /ig/ml) at 30°C. 
Transf ormants kept were: 

5 

SJ5255: PP289 - 5/pSJ5198 . 
SJ5257: PP2 8 9 - 5/pSJ524 8 . 

Donor strains SJ5255 and SJ5257 were used in conjugations to 
10 recipient SJ5231. Selection of transconjugants was on erythromycin (5 

pig/ml) , at 30°C. Transconjugants were streaked on TSS plates with xylose, at 
50°C. In parallel, SJ5221 was streaked as a xylose isomerase positive 
control strain (also at 50°C) . 

After overnight incubation, all strains had formed a very thin, 
15 transparent growth. The control, however, was better growing and colonies 
were brownish. 

After another day of incubation at 50°C, some brownish colonies were 
coming up on the background of thin, transparent growth, in transconjugants 
derived from SJ5257, i.e. the strain containing the Integration plasmid with 

20 the PxylA-xylA fragment (D) . These colonies were steadily growing, and 
further colonies were coming up, during subsequent days of continued 
incubation at 50°C. 

No brownish colonies (and no further growth than the thin, 
transparent growth seen after the first overnight incubation) were observed 

25 from transconjugants derived from SJ5255 (the negative control, unable to 
restore the chromosomal xylA gene) . 

Directed integration of an alpha -amylase gene into the xyl region. 

Construction of an amyL containing integration plasmid 
30 Plasmids pSJ5291 and pSJ5292 were constructed from the integration 

vector plasmid pSJ5247 by digestion of this plasmid with Bglll, and 
insertion of the 1.9 kb amyL containing Bglll-Bcll fragment from pSJ4457 
(described in WO 99/41358) . The ligation mixture was transformed into B. 
subtilis DN1885 and two transf ormants were kept as SJ5291 and SJ5292. 

35 

Construction of conjugative donor strains, transfer to B. licheniformis 
hosts, and chromosomal integration 

Plasmids pSJ5291 and pSJ5292 were transformed into competent cells of 
the B. subtilis conjugation donor strain PP289-5 (which contains a 
40 chromosomal dai-deletion, and plasmids pBC16 and pLS20) , transf ormants were 
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selected for resistance to erythromycin (5 jig/ml) and tetracycline (5 jzg/ml) 
on plates with D-alanine (100 /ig/ml) at 30°C. 

Transformants kept were SJ5293 (PP289-5/pSJ5291) and SJ5294 (PP289- 
5/pSJ5292) . These two strains were used as donors in conjugations to xylose 
5 isomerase deletion strains SJ5231 and SJ5232 . Transconjugant s were selected 
on LB PGA plates with erythromycin (5 /xg/ml) , and one or two tetracyclin- 
sensitive transconjugants from each conjugation were streaked on a TSS- 
xylose plate which was incubated at 50°C. After two days incubation, well- 
growing colonies were inoculated into liquid TY medium (10 ml) without 

10 antibiotics, and these cultures were incubated with shaking at 30°C. After 
overnight incubation, 100 pi from each culture were transferred into new 10 
ml TY cultures, and incubation repeated. This procedure was repeated another 
two times, and in addition the cultures were plated on TSS-xylose plates at 
30°C. After about a week, all plates were replicaplated onto TSS-xylose as 

15 well as LBPSG with erythromycin (5 /ig/ml) . The following day, putative Em- 
sensitive strains were restreaked on the same plate types. 
The following Em sensitive strains, which all grow well on TSS-xylose 
plates, were kept: 

20 SJ5308 (from conjugation donor SJ5293, host SJ5231) 

SJ5309 (from conjugation donor SJ5293, host SJ5231) 

SJ5310 (from conjugation donor SJ5293, host SJ5232) 

SJ5315 (from conjugation donor SJ5294, host SJ5231) 

25 Southern analysis. 

The two-copy amyL strain SJ4671, and strains SJ5308, SJ5309, SJ5310 
and SJ5315, were grown overnight in TY-glucose, and chromosomal DNA was 
extracted. The chromosomal DNA was digested with Hindlll, fragments 
separated by agarose gel electrophoresis, transferred to Immobilon-N* 

30 filters (Millipore*) and hybridised to a biotinylated probe based on Hindlll 
digested pSJ5292 (using NEBlot Photope Kit and Photope Detection Kit 6K) . 

In the two -copy strain, the two amyL gene copies reside on a -10 kb 
f/indlll fragment. In addition, an -2.8 kb fragment is hybridizing, which is 
due to hybridization to the xyl region. In the four strains with insertions 

35 of a third amyL gene into the xylose gene region, the -2.8 kb fragment is 
missing and has been replaced by a fragment of -4.6 kb. This is entirely as 
expected upon integration of the amyL gene into the xylose gene region. The 
-10 kb fragment due to the two-copy insertion is retained. 
In conclusion, the southern analysis shows that strains SJ5308. SJ5309, 

40 SJ5310 and SJ5315 have a correctly inserted third amyL gene copy in their 
chromosome . 
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Shake flask evaluation. 

Strains with the amyL gene integrated in the xylose isomerase region, 
as well as several control strains, were inoculated into 100 ml BPX medium 
5 in shake flasks and incubated at 37°C with shaking at 300 rpm for 7 days. 
Alpha-amylase activity in the culture broth was determinated by the Phadebas 
assay: 



Relative alpha-amylase 



10 


Strain 




Units /ml 




SJ4270 


(one copy amyL strain) 


100 




SJ4671 


(two copy amyL strain) 


161 




SJ5231 


(two copy amyL strain with xylA gene 


deletion) 148 




SJ5308 


(three -copy amyL strain) 


200 


15 


SJ5309 


(three -copy amyL strain) 


245 




SJ5310 


(three -copy amyL strain) 


200 




SJ5315 


(three-copy amyL strain) 


200 



Aliquot s from each shake flask were plated on amylase indicator 
20 plates. All colonies were amylase positive. Four single colonies from each 
of SJ4671, SJ5309 and SJ5315 were inoculated into fresh BPX shake flasks, 
which were cultured as above. Alpha-amylase activity in the culture broth 
was determinated by the Phadebas assay: 



25 



Strain 



Relative alpha-amylase 
Units/ml 



SJ4671 (two copy amyL 1 strain) 
SJ4671 
SJ4671 
30 SJ4671 

SJ5309 (three -copy amyL strain) 

SJ5309 

SJ5309 

SJ5309 

35 SJ5315 (three-copy amyL strain) 
SJ5315 
SJ5315 

SJ5315 



100 
102 
88 
84 
149 
141 
135 
149 
135 
147 
159 
153 
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Under these shake flask conditions, the three copy amyL strains 
(bold) seem to produce about 50% more alpha- amylase than the two-copy 
strain. 

5 

Example 2 

A strain of Bacillus licheniformis having two stably integrated amyL 
gene copies in its chromosome, inserted in opposite relative orientations in 
the region of the B. licheniformis alpha- amylase gene, amyL, has been 
10 described in WO 99/41358, as SJ4671. A third copy of the amyL gene was 
inserted in xylRA as described above 

This describes the insertion into this three-copy strain of a fourth 
amyL gene copy by selectable, directed integration into another region of 
the B. licheniformis chromosome. 

15 

Gluconat deletion/ integration outline (Figure 2) 

The sequence region of the Bacillus lichenformis gluconate operon 
comprising the gntR; gntK, gntP, gntz genes for utilization of gluconate is 
available in Genbank/EMBL with accession number D31631. The region can be 

20 schematically drawn as shown in figure 2. 

A deletion was introduced by cloning, on a temperature -sensitive 
plasmid, the PCR amplified fragments denoted as W A" (containing part of the 
gntK and part of the gntP gene) and M B" (containing an internal fragment of 
gntz) . As a help in the selection of deletion strains, a kanamycine 

25 resistance gene flanked by resolvase sites was introduced between fragments 
"A" and n B" , resulting in the plasmid denoted "Deletion plasmid" in figure 
2. This kanamycine resistance gene could later be removed by resolvase- 
mediated site-specific recombination, as described in WO 96/23073. 

The deletion was transferred to the chromosome of target strains by 

30 double homologous recombination via fragments U A" and n B", mediated by 
integration and excision of the temperature-sensitive plasmid. The result 
was the strain, labelled "Deletion strain" in figure 2. This strain is 
unable to grow on minimal media with gluconate as sole carbon source. 

35 Plasmid constructs 

To construct an Integration plasmid to be used for gene insertions, 
the PCR fragment W C" was amplified. This fragment contained an internal 
fragment of gntP of about 1 Kb. The Integration plasmid consists of 
fragments W B" and "C" on a temperature- sensitive vector. The expression 
40 cassette destined for integration is cloned between W B" and W C" . Upon 
transfer to the B. licheniformis Deletion strain and integration and 
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excision of the temperature- sensitive vector, strains could be isolated 
which grew on minimal media with gluconate as sole carbon source. Such 
strains had restored the chromosomal gntP gene by double homologous 
recombination via fragments W B" and "C . In this process, the expression 
5 cassette was integrated into the chromosome resulting in the "Integration 
strain" of figure 2 . 

PCR amplifications were performed with Ready-To-Go PCR Beads from 
amersham pharmacia biotech as described in the manufacturers instructions, 
using an annealing temperature of 5 5°C. 

10 

The Deletion Plasmids pMOL1789 and pMOL1790: 

The "B" fragment (containing the internal part of the gntZ) was 
amplified from chromosomal DNA from Bacillus licheniformis using primers 

15 #187338 [Aval <~D31631 4903-4922->] (SEQ ID 6) : 
5' -TATTTCCCGAGATTCTGTTATCGACTCGCTC 

#187339 [Eagl <-D31631 5553-5538->] (SEQ ID 7) : 
5 ' -GTTTTCGGCCGCTGTCCGTTCGTCTTT 

20 

The fragment was digested with Aval + KagrI, ligated to Aval + EagI 
digested pM0L1642, and the ligation mixture transformed, by transformation, 
into B . subtilis PL1801 selecting for erythromycin resistance (5 //g/ml) . 
The insert on three clones was sequenced, and all found to be correct. 
25 MOL1789 (JA578 (repF*) /pM0L178 9) and MOL1790 { JA578/pMOL1790) were kept. The 
endpoint of the "B" fragment relative to gntZ is shown in fig. 2. 

Plasmids pMOL1820 and pMOL1821: 

The "A" fragment (containing part of the gntK and part of the gntP 
30 gene) , was amplified from chromosomal DNA of Bacillus licheniformis using 
primers 

#184733 [^D31631 3738-3712->] (SEQ ID 8) : 
5' -GTGTGACGGATAAGGCCGCCGTCATTG 

35 

#184788 [4-D31631 3041-3068->] (SEQ ID 9) : 
5' -CTCTTGTCTCGGAGCCTGCATTTTGGGG 

The fragment was digested with Clal + EcoRI, ligated to EcoRI + Clal 
40 digested pMOL1789, and transformed, by transformation, into B . subtilis 

PL1801 selecting for erythromycin resistance (5 fig/ml) . The insert on three 
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clones was sequenced, and all found to be correct. MOL1820 ( JA578/pMOLl820) 
and MOL1821 ( JA57 8/pMOL1821) were kept. The endpoint of the W A" fragment 
relative to gntZ is shown in fig. 2. 

5 The Integration plasmids pMOL1912 and pMOL1913: 

These plasmids contain a short C-terminal part of gntK and the entire 
open reading frame of gntP (the W C" fragment) on a temperature-sensit ive, 
mobilizable vector. They were made by ligating a 0.9 kb fragment amplified 
from chromosomal DNA of Bacillus licheniformis using primers: 

10 

#B1656D07 [^D31631 3617-3642">] (SEQ ID 10) : 
S r -AGCATTATTCTTCGAAGTCGCATTGG 

#B1659F03 [BglII^D31631 4637-4602-*] (SEQ ID 11) : 
15 5 ' - TTAAGATCTTTTTTATACAAATAGGCTTAACAATAAAGTAAATCC 

The fragment was digested with Bglll + EcoRI t ligated to Bglll + 
EcdRI digested pMOL1820, and the ligation mixture transformed, by 
transformation, into B. subtilis PL1801 selecting for erythromycin 
20 resistance (5 //g/ml) . The insert on three clones was sequenced, and all 

found to be correct. MOL1912 (PL1801/pMOL1789) and MOL1913 (PL1801/pMOL1913) 
were kept. The endpoint of the W C" fragment relative to gntZ is shown in 
fig. 2. 

These plasmids were found to express functional GntP even if they do 
25 not have a promoter sequence directly upstream of the gntP gene. In order to 
enable directed integration in the gntP region by selecting for growth on 
gluconate it was necessary to delete part of the N- terminal sequence of the 
gntP gene on the integration plasmid pMOL1912 . 

30 Plasmids pMOL1972 and pMOL1973: 

These plasmids are Deletion derivatives of pMOL1912 which contain the 
entire gntP gene except for the first 158 bp coding for 53 amino acids of 
the N-terminal. The plasmid pMOL1912 was digested with StuI + EcoRV and re- 
ligated. The ligation mixture was transformed, by competence, into B. 

35 subtilis PL1801 selecting for erythromycin resistance (5 //g/ml) . The 

deletion was verified by restriction digest. MOL1972 (PL18 01/pMOIil972 ) and 
MOL1973 (PL180l/pMOL1973) were kept. 

These plasmids do not support growth on TSS gluconate plates when 
introduced as free plasmids in a gntP deleted background. 

40 

Construction of strains with chromosomal gntP d let ions 
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The Deletion plasmid pMOL1920 was transformed into competent cells of 
the B. subtilis conjugation donor strain PP289-5 (which contains a 
chromosomal dal-deletion, and plasmids pBC16 and pLS20) , selecting 
resistance to kanamycine (10 //g/ml) , erythromycin (5 fxg/ml) and tetracycline 
5 (5 /-tg/ml) on plates with D-alanine (100 /ig/ml) at 30°C. Two transf ormants 
were kept, MOL1822 and MOL1823 . 

The two-copy B. licheniformis alpha-amylase strain SJ4671, described in WO 
99/41358 was used as recipient in conjugations. 

Donor strains MOL1822 and MOL1823 .were grown overnight at 30°C on 
10 LBPSG plates (LB plates with phosphate (0.01 M K 3 P0 4 ) , glucose (0.4 %) , and 
starch (0.5 %) ) supplemented with D-alanine (100 /zg/ml) , kanamycine (10 
fig /ml) f erythromycin (5 /ig/ml) and tetracycline (5 /ig/ml) . The recipient 
strain was grown overnight on LBPSG plates. 

A loopful of donor and recipient were mixed on the surface of a LBPSG plate 

15 with D-alanine (100 /ig/ml) , and incubated at 30°C for 5 hours. This plate 
was then replicated onto LBPSG with erythromycin (5 //g/ml) and kanamycine 
(10 /xg/ml) , and incubation was at 30°C for 2 days. These four conjugations 
resulted in between 25 and 50 transconjugants . 

Tetracycline-sensitive (indicating absence of pBC16) transconjugants 

20 were reisolated on LBPSG with erythromycin (5 /ig/ml) and kanamycine (10 

jig/ml) at 50°C, incubated overnight, and single colonies from the 50°C plates 
were inoculated into 10 ml TY liquid cultures and incubated with shaking at 
26°C for 3 days, then aliquots were transferred into fresh 10 ml TY cultures 
and incubation continued overnight at 30°C. The cultures were then plated on 

25 LBPSG with 10 /ig/ml kanamycine, after overnight incubation at 3 0°C these 
plates were replica plated onto kanamycine and erythromycin, respectively, 
and erythromycin sensitive, kanamycine resistant isolates were obtained from 
all strain combinations. The following strains, where part of the gntP gene 
coding for the C-terminal was replaced by the res-kana-res cassette, were 

30 kept : 

MOL1871; SJ4671 recipient, MOL1822 donor. 
MOL1872: SJ4671 recipient, MOL1823 donor. 

35 Strain phenotypes were assayed on TSS minimal medium agar plates, prepared 
as follows: 

400 ml H 2 0 is added 10 g agar and is autoclaved at 121°C for 20 
minutes, and allowed to cool to 60°C. The following sterile solutions are 
added : 



1 M Tris pH 7.5 



25 ml 
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2 % FeCl 3 .6H 2 0 1 ml 

2 % trisodium citrate dihydrate 1 ml 

1 M K 2 HPO, 1.25 ml 

10 % MgS0 4 .7H 2 0 1 ml 

10 % glut amine 10 ml, and 

20 % glucose 12.5 ml, or 

15 % gluconate 16.7 ml 



Bacillus licheniformis SJ4671 grows well on both glucose and 
10 gluconate TSS plates, forming brownish coloured colonies. The gntP Deletion 
strains MOL1871 and MOL1872 grow well on glucose TSS plates, but only a very 
thin, transparent growth is formed on the TSS gluconate plates, even after 
prolonged incubation. These strains are clearly unable to use gluconate as 
the sole carbon source. 
15 The same gntP deletion procedure is performed on the three copy 

strain SJ53 09 described earlier to prepare for integration of a fourth copy 
of the amylase expression cassette. 



Directed and selectable integration into the gnt region 

20 Integration plasmid pMOL1972 (containing the n B" and W C" fragments) , 

and as a negative control pMOL1789 (containing only the n B" fragment) , were 
transformed into competent cells of the B . subtilis conjugation donor strain 
PP289-5 (which contains a chromosomal dal-deletion, and plasmids pBC16 and 
pLS20) , selecting resistance to erythromycin (5 /ig/ml) and tetracycline (5 

25 /zg/ml) on plates with D-alanine (100 //g/ml) at 30°C. Transf ormants kept 
were : 



MOL1974 : PP289 - 5/pM0L1972 . 
MOL1975: PP289-5/pMOL1973 . 

30 

Donor strains M0L1974 and MOL1975 were used in conjugations to 
recipient MOL1871 and MOL1872 . Selection of transconjugants was on 
erythromycin (5 ^g/ml) , at 30°C. Transconjugants were streaked on TSS plates 
with gluconate, at 50°C. In parallel, SJ4671 was streaked as a gluconate 

35 positive control strain (also at 50°C) . 

After overnight incubation, all strains had formed a very thin, 
transparent growth. The control, however, was better growing and colonies 
were brownish. After another day of incubation at 50°C, some brownish 
colonies were coming up on the background of thin, transparent growth, in 

40 transconjugants derived from MOL1871 and MOL1872. These colonies were 
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steadily growing, and further colonies appeared, during subsequent days of 
continued incubation at 50°C. 

No colonies were observed from the gntP deleted strains MOL1871 and 
MOL1872 . 

5 

Directed integration of an alpha -amylase gene into the gnt region 

Construction of an amyL containing Integration plasmid. 

The following is a construction plan for integrating an expression 
cassette with the alpha -amylase gene in the gnt region making use of the 

10 selection principle described above. The integration plasmid pMOL1972 is 
digested with Bsrlll, and a 1 . 9 kb Bglll-BcII fragment containing amyL from 
pSJ4457 (described in WO 99/41358) is inserted by ligation. The ligation 
mixture is then transformed into B. subtilis DN1885 and transf ormants 
selected on LBPSG plates with erythromycin (5 jzg/ml) are verified by 

15 restriction digestion of plasmid DNA. 

Conjugative donor strains, transfer to B. licheniformis, and chromosomal 
integration. 

The Integration plasmid with the expression cassette is transformed 
20 into competent cells of the B. subtilis conjugation donor strain PP2 89-5 
(which contains a chromosomal dal- deletion, and plasmids pBC16 and pLS20) , 
selecting resistance to erythromycin (5 /zg/ml) and tetracycline (5 pig/ml) on 
plates with D-alanine (100 ^g/ml) at 30°C. 

Transf ormants comprising the Integration plasmid with the expression 
25 cassette are preserved and used as donors in conjugations with a gntP 
Deletion recipient of the three-copy strain SJ5309, which in turn was 
constructed as described for the Deletion strains MOL1871 and MOL1872 
described above. 

Transconjugants are selected on LB PGA plates with erythromycin (5 
30 ng/ml) , and one or two tetracycl in- sensitive transconjugants from each 

conjugation is streaked on a TSS-gluconate plate which is incubated at 50°C. 
After two days incubation, well-growing colonies are inoculated into liquid 
TY medium (10 ml) without antibiotics, and these cultures are incubated with 
shaking at 30°C. After overnight incubation, 100 jzl from each culture is 
35 transferred into new 10 ml TY cultures, and incubated. This procedure is 
repeated twice, and in addition the cultures are plated on TSS-gluconate 
plates at 30°C, 

After about a week, all plates are replica-plated onto TSS-gluconate 
as well as LBPSG with erythromycin (5 /ig/ml) and incubated. The following 
40 day putative Em- sensitive strains are restreaked on the same plate types 
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As for integration in the xylose region described earlier, Southern 
analysis and shake flask evaluation is performed to verify the site of 
integration in the gnt region of the alpha -amylase expression cassette and 
the increased yield from this four copy strain. 

5 

Example 3 

Bacillus licheniformis SJ4671 (WO 99/41358) comprises two stably 
integrated amyL gene copies in its chromosome, inserted in opposite relative 
orientations in the region of the B. licheniformis alpha-amylase gene, amyL. 
10 The following example describes the insertion into this strain of a third 
amyL gene copy by selectable, directed integration into another region of 
the B . licheniformis chromosome. 

D-alanine racemase deletion/integration outline 

15 The DNA sequence of the Bacillus lichenformis D-alanine racemase 

region was determined in this work and is shown in positions 1303 to 2469 in 
SEQ ID 12. A plasmid denoted w Dal-Deletion plasmid" was constructed by 
cloning one 2281 bp PCR amplified fragment from the D-alanine racemase 
region of Bacillus lichenformis on a temperature -sensitive parent plasmid. 

20 The PCR 2281 bp fragment was denoted W A" , wherein A comprises the sequence 
from 24 5 basepairs upstream of the ATG start codon of the dal gene to 867 
basepairs downstream of the dal gene . 

A deletion of 586 basepairs of the C-terminal part of the dal gene on 
the cloned fragment A was done resulting in a plasmid containing the 

25 fragments W B" and "C" as shown below. A spectinomycin resistance gene 

flanked by resolvase (res) sites was introduced between fragments M B" and 
»C" on the plasmid. This spectinomycin resistance gene could later be 
removed by resolvase -mediated site- specific recombination. 

The D-alanine racemase deletion was transferred from the Dai-Deletion 

30 plasmid to the chromosome of a Bacillus target strain by double homologous 
recombination via fragments W B" and "C" , mediated by integration and 
excision of the temperature-sensitive Dai-Deletion plasmid. The resulting 
strain was denoted "Dai-Deletion strain". This strain was unable to grow on 
media without D-alanine. 

35 An Integration plasmid was constructed for insertion of genes into 

the D-alanine region of the Deletion strain. We intended to PCR-amplify a 
fragment denoted "D" comprising 1117 basepairs of the dal gene starting from 
41 basepairs downstream of the ATG start codon. The promoter region was 
substituted with the Tl and T2 terminators from the 3' -terminal sequence of 

40 the Escherichia coli rrnB ribosome RNA operon (EMBL/e09023 : basepair 197- 
295) . 
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The Integration plasmid comprises fragments D and C on a temperature- 
sensitive vector. An expression cassette destined for integration was cloned 
between the fragments D and C. Upon transfer to the B . licheniformis 
deletion strain, integration, and excision of the temperature -sensitive 
5 vector, strains could be isolated which grow on media without D-alanine. 

Such "Integration strains" have restored the chromosomal dal gene, by double 
homologous recombination via fragments D and C. In this process, the 
expression cassette was integrated into the chromosome. 

10 Plasmid constructs 

PGR amplifications were performed with Ready -To -Go PCR Beads from 
amersham pharmacia biotech as described in the manufacturers instructions, 
using an annealing temperature of 55°C. 

15 Plasmids pJA744: 

The A fragment (dal -region) was amplified from Bacillus licheniformis 
SJ4671 chromosomal DNA using primers: 

#148779; [Upstream of a Sphl site in the dal region] (SEQ ID 14) : 
20 5 ' -GATGAACTTCTGATGGTTGC 

#148780: [BamHI < dal] (SEQ ID 15): 
5' -AAAGGATCCCCCTGACTACATCTGGC 

25 The PCR fragment was digested with Sphl and BairiKl and purified, then 

ligated to Sphl and BamHI digested pt*L2438. Transforming B , subtil is JA691 
(repF*, dal ) competent cells with the ligation mix followed by selecting for 
kanamycin resistance (10 /xg/ml) . Correct clones could complement the JA691 
dal phenotype . 

30 

Plasmid pJA770: 

This plasmid contains a res-spc-res cassette inserted between the B 
and C fragments. It was constructed by ligating the 1.5 kb BclI-BajriHl 
fragment from pSJ3358 into the Bell - Bell sites of pJA744 . Transforming B. 
35 subtilis JA$9_i competent cells with the ligation mix followed by selecting 
for kanamycin resistance (10 /xg/ml) and spectinomycin resistance (120 
/ig/ml) . Orientation of the spectinomycin resistance gene was could be 
determined by cutting with Bell and BamHI , 



40 Dal Deletion plasmid 

Plasmid pJABSl: 



OOO-DK 



38 

A fragment (comprising the ermC gene and the replication origin of 
pE194) was PCR amplified from pSJ2739 plasmid DNA using primers: 

#170046 [WotI; < ermC gene and the replication origin of pE194>] (SEQ ID 16) 
5 5 ' - AAAGCGGCCGCGAGACTGTGACGGATGAATTGAAAAAGC 

#170047 [FcoRI; <- ermC gene and the replication origin of pE194-^] (SEQ ID 
17) : 

5' -AAAGAATTCGTGAAATCAGCTGGACTAAAAGG 

10 

The PCR fragment was digested with EcoRI and NotI and purified, then 
ligated to EcdRl and Ndtl digested pJA770. Transforming B. subtilis J&691 
competent cells with the ligation mix followed by selecting for erythromycin 
resistance (5 fig/ml) and spectinomycin resistance (120 /zg/ml) , 

15 

Plasmid PJA748: 

A fragment {comprising the dal gene without the promotor region) was 
PCR amplified from Bacillus lichenif ormis SJ4671 DNA using primers: 

20 #150506 [BamHI; < dal gene] (SEQ ID 18) 

5 ' - AAAGGATCCCGCAAGCAAAGTTGTTTTTCCGC 

#150507 [JCpnl; <- dal gene] (SEQ ID 19) : 
5 ' -AAAGGTACCGAAAGACATGGGCCGAAATCG 

25 

The PCR fragment was digested with KpnJ and BamHI and purified, then 
ligated to Kpril and BamHI digested pPL2430. Transforming B. subtilis JA691 
competent cells with the ligation mix followed by selecting for kanamycin 
resistance (10 jig/ml) . 

30 

Plasmids pJA762: 

A fragment {comprising the T x and T 2 Terminators from the E.coli rrnB 
terminal sequence EMBL [e09023] from basepair 197 to 295) was PCR amplified 
35 from Escherichia coli SJ2 DNA using primers: 

#158089 [Xpnl; < T x and T 2 Terminators of rrnB] (SEQ ID 20) 
5 ' -AAAGGTACCGGTAATGACTCTCTAGCTTGAGG 

40 #158090 [Clal; < Ti and T 2 Terminators of rrnB] (SEQ ID 21): 
5' -CAAATCGATCATCACCGAAACGCGGCAGGCAGC 
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The PCR fragment was digested with Kpnl and Clal and purified, then 
ligated to Kpnl and Clal digested pJA748. Transforming B. subtilis 
competent cells with the ligation mix followed by selecting for kanamycin 
5 resistance (10 /*g/ml) . 



Plasmids pJA767: 

10 A fragment (comprising the 0.7kbp DNA sequence downstream of dal 

(DFS) ) was PCR amplified from Escherichia coli SJ2 DNA using primers: 

#150508 [Hindlll; < DFS] (SEQ ID 22) 

5' -ATTAAGCTTGATATGATTATGAATGGAATGG 

15 

#150509 [Nhel; < DFS] (SEQ ID 23): 

5' -AAAGCTAGCATCCCCCTGACTACATCTGGC 

The PCR fragment was digested with Hindlll and Nhel and purified, 
20 then ligated to Kpnl and Clal digested pJA762. Transforming B. subtilis 
JA691 competent cells with the ligation mix followed by selecting for 
kanamycin resistance (10 /zg/ml) . 

Plasmid pJA776 

25 This plasmid contains the amyL cassette flanked by the D and C 

fragments. It was constructed by ligating the 2.8 kb Hindlll-Mhel fragment 
from pSJ4457 to the 4.2 kb BanH I -Hindlll fragment from pJA767, and 
transforming the ligation mix into B. subtilis JA691 competent cells 
followed by selecting for kanamycin resistance (10 pig/ml) . 

30 

Dal Integration plasmid 

Plasmid pJA1020: 

This plasmid contains the amyL cassette flanked by the D and C 
fragments. Further the plasmid contains the plasmid pE194 replication 
35 origin, repF and the Em r -gene . It was constructed by ligating the 2 . 7kb 
EcoRl-Nhel fragment of pJA776 to the 3 . 8kb EcoRI-Whel fragment of pJA851, 
and transforming the ligation mix into B. subtilis JA691 competent cells 
followed by selecting for erythromycin resistance (5 /ig/ml) . 

40 

Construction of chromos mal dal deletions 
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The Deletion plasmid pJA851 was transformed into competent cells of 
the B. stiJbtilis conjugation donor strain PP289-5 (which contains a 
chromosomal dal-deletion, and plasmids pBC16 and pLS20) , and transf ormants 
were selected for resistance to spectinomycin (120 /ig/ml) , erythromycin (5 
5 ptg/ml) , and tetracycline (5 ng/ml) on plates with D-alanine (100 £ig/ml) at 
30°C. Transf ormants were kept as JA954 and used as donor in the following 
conjugation experiments. 

The two-copy amyL B. licheniformis SJ4671 (WO 99/41358) was used as 
recipient in the following conjugation experiments. 

10 Donor strain JA954 were grown overnight at 3 0°C on LBPSG plates (LB 

plates with phosphate (0.01 M K 3 PO«) , glucose (0.4 %) , and starch (0.5 %) ) 
supplemented with D-alanine (100 /xg/ml) , spectinomycin (120 fig/ml) , 
erythromycin (5 j/g/ml) and tetracycline (5 /xg/ml) . The recipient strain 
SJ4671 was grown overnight on LBPSG plates. 

15 Approx. one loop of an inoculation needle of donor and recipient each 

were mixed on the surface of a LBPSG plate with D-alanine (100 /ig/ml) , and 
incubated at 30°C for 5 hours. This plate was then replicated onto LBPSG 
with erythromycin (5 /ig/ml) and spectinomycin (120 /ig/ml) , and was incubated 
at 30°C for 2 days. These four conjugations resulted in 13 - 25 

20 transconjugants . 

Tetracycline- sensitive (indicating absence of pBC16) transconjugants 
were reisolated on LBPSG plates with erythromycin (5 /ig/ml) and 
spectinomycin (120 pg/ml) at 50°C, and incubated overnight. Single colonies 
from the 50°C plates were inoculated into 10 ml TY liquid medium and 

25 incubated with shaking at 26°C for 3 days, whereafter aliquot s were 
transferred into fresh 10 ml TY cultures and incubation was continued 
overnight at 30°C. The cultures were plated on LBPSG with 120 /ig/ml 
spectinomycin, after overnight incubation at 30°C these plates were replica 
plated onto spectinomycin and erythromycin, respectively. 

30 Erythromycin sensitive, spectinomycin resistant isolates were 

obtained from all strain combinations. The following straincomprising the 
chromosomal dal promoter and the first 672 basepairs of the dal coding 
sequence replaced by the res-spc-res cassette, was kept: 

35 B. lichenifo2rmis JA967: SJ4671 recipient, JA954 donor. 

Strain pheno types were assayed on LBPG with 120/tg spectinomycin 
supplemented with or without D-alanine (100 /ig/ml) 

Bacillus licheniformis SJ4671 grows well on both plates with or 
40 without D-alanine. The xylA deletion strain JA967 growth well on LBPG D- 
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alanine plates, but not on LBPG plates without D-alanine. These strains are 
clearly unable to grow without adding D-alanine to the media. 

The sequence of the B. lichenif ormis dal -region (SEQ ID 12) : 

5 The dal -region (comprising the ydcC gene, a terminator, the dal gene 

and the sequence downstream of dal {DFS) ) was PCR amplified from Bacillus 
lichenif ormis ATCC14580 chromosomal DNA using the primers: 

#145507 [ < ydcC - dal - DFS >] (SEQ ID 24) : 
10 5 ' - GCGTACCGTTAAAGTCGAACAGCG 

#150509 [Nhel; < ydcC - dal - DFS >] (SEQ ID 25) : 
5 ' - AAAGCTAGCATCCCCCTGACTACATCTGGC 

15 Sequencing of the D-alanine encoding sequence of Bacillus 

licheniformis that is shown in positions 13 03-2469 of SEQ ID 12 and a 
subsequent homology search in the public databases revealed that the newly 
isolated dal gene has a sequence identity of only approx. 67% with the dal 
gene of Bacillus subtilis, no other D-alanine racemase encoding genes show a 

20 higher homolgoy to this new B. licheniformis dal gene. 
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Claims 

1. A method for constructing a cell comprising at least two copies of a gene 
of interest stably integrated into the chromosome in different positions, 
the method comprising the steps of: 

5 a) providing a host cell comprising at least one chromosomal copy of the 
gene of interest; 

b) altering a conditionally essential chromosomal gene(s) of the host cell 
whereby the gene becomes non-funtional ; 

c) making a DNA construct comprising: 

10 i) an altered non- functional copy of the chromosomal gene(s) of 

step b) ; and 

ii) at least one copy of the gene of interest flanked on one side by 
i) and on the other side by a DNA fragment homologous to a host cell 
DNA sequence adjacent to the gene(s) of step b) ; wherein a first 
15 recombination between the altered copy of i) and the altered 

chromomosomal gene(s) of step b) restores the chromosomal gene(s) to 
functionality and renders the cell selectable; 

d) introducing the DNA construct into the host cell and cultivating the 
cell under conditions that require a functional gene(s) of step b) ; and 

20 e) selecting a host cell that grows under the conditions of step d) ; 

wherein the at least one copy of the gene of interest has integrated 
into the host cell chromosome adjacent to the gene(s) of step b) ; and 
optionally 

f) repeating steps a) to e) at least once using a different chromosomal 
25 gene(s) in step b) in each repeat. 

2. The method of claim l, wherein subsequent to step d) and prior to step e) 
a second recombination takes place between the DNA fragment and the 
homologous host cell DNA sequence. 

30 

3. The method of claim 2, where the DNA construct further comprises a marker 
gene which is located in the construct such that it is recombined out of the 
chromosome by the second recombination. 

35 4. The method of claim 3, wherein the marker gene confers resistance to an 
antibiotic, preferably the antibiotic is chosen from the group consisting of 
chloramphenicol, kanamycin, ampillin, erythromycin, spectinomycin and 
tetracycline . 
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5. The method of claims 3 or 4 , wherein a host cell is selected which grows 
under the conditions of claim 1 step d) and which does not contain the 
marker gene in the chromosome. 

5 6. The method of any of claims 1-5, where the DNA construct further 
comprises a marker gene located between the altered copy and the DNA 
fragment, and wherein the marker gene is flanked by nucleotide sequences 
that are recognized by a specific resolvase, preferably the nucleotide 
sequences are res. 

10 

7. The method of claim 6, wherein the marker gene is excised from the 
chromosome by the action of a resolvase enzyme after step e) and prior to 
step f ) . 

15 8. The method of any of claims 1-7, wherein the gene of interest 
originates from the host cell . 

9. The method of any of claims 1-8, wherein the gene of interest encodes 
an enzyme, preferably an amylolytic enzyme, a lipolytic enzyme, a 

20 proteolytic enzyme, a cellulytic enzyme, an oxidoreductase or a plant cell- 
wall degrading enzyme, and more preferably an enzyme with an activity 
selected from the group consisting of aminopeptidase, amylase, 
amyloglucosidase, carbohydrase, carboxypeptidase, catalase, cellulase, 
chitinase, cutinase, cyclodextrin glycosyltransf erase , deoxyribonuc lease , 

25 esterase, galactosidase, beta-galactosidase, glucoamylase, glucose oxidase, 
glucosidase, haloperoxidase, hemicellulase , invertase, isomerase, laccase, 
ligase, lipase, lyase, mannosidase, oxidase, pectinase, peroxidase, phytase, 
phenoloxidase , polyphenoloxidase , protease , ribonuclease , transferase , 
transglutaminase, or xylanase. 

30 

10. The method of any of claims 1-9, wherein the host cell selected in 
step e) comprises DNA only of endogenous origin. 

11. The method of any of claims 1-10, wherein the chromosomal gene{s) of 
35 the host cell is altered by partially deleting the gene, or by introducing 

mutations in the gene(s). 

12. The method of any of claims 1 - 11, wherein the chromosomal gene(s) of 
the host cell that is altered in step b) encodes a D-alanine racemase, 

40 preferably the gene(s) is a dal homologue from a Bacillus cell, more 
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preferably the gene is homologous to dal from Bacillus subtilis, and most 
preferably the gene(s) is the dal gene of Bacillus licheniformis, 

13. The method of any of claims 1 - 11, wherein the chromosomal gene(s) of 
5 the host cell that is altered in step b) encodes a D-alanine racemase and is 
at least 75% identical, preferably 85% identical, more preferably 95% and 
most preferably at least 97% identical to the dal sequence of Bacillus 
licheniformis shown in positions 1303 to 2469 in SEQ ID 12. 

10 14. The method of any of claims 1-11, wherein the chromosomal gene(s) of 
the host cell that is altered in step b) is one or more genes of a xylose 
isomerase operon, preferably the gene(s) is homologous to the xylR gene or 
the xylA gene from Bacillus subtilis, more preferably the gene(s) is 
homologous to both xylR and xylA, and most preferably the gene(s) is 

15 homologous to one or more genes of the xylose isomerase operon of Bacillus 
licheni formi s . 

15. The method of any of claims 1 - 11, wherein the chromosomal gene(s) of 
the host cell that is altered in step b) encodes a galactokinase (EC 

20 2.7.1.6), an UTP-dependent pyrophosphorylase (EC 2.7.7.10), an UDP-glucose- 
dependent uridylyltransf erase (EC 2.7.7.12), or an UDP-galactose epimerase 
[EC 5.1.2.3), preferably the gene(s) encodes an UDP-galactose epimerase (EC 
5.1.2.3), more preferably the gene{s) is homologous to galE of a Bacillus, 
and most preferably the gene is galE of Bacillus licheniformis. 

25 

16. The method of any of claims 1 - 11, wherein the chromosomal gene(s) of 
the host cell that is altered in step b) is one or more genes of a gluconate 
operon, preferably the gene(s) encodes a gluconate kinase (EC 2.7.1.12) or a 
gluconate permease or both, more preferably the gene(s) is one or more genes 

30 homologous to genes from Bacillus subtilis chosen from the group consisting 
of gntR, gntK, gntP, and gntZ, and most preferably the gene(s) is one or 
more genes of gntJ?, gntK, gntP, and gntZ from Bacillus licheniformis . 

17. The method of any of claims 1 - 16, wherein the host cell is a Gram- 
35 positive bacterial cell, preferably a Bacillus cell, and most preferably a 

Bacillus cell chosen from the group consisting of Bacillus alkalophilus, 
Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus 
clausii, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus 
licheniformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus 
40 subtilis, and Bacillus thuringiensis . 
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18. The method of any of claims 1-17, wherein the DNA construct is a 
plasmid . 

19. A DNA construct comprising: 

5 i) an altered non- functional copy of a conditionally essential 

chromosomal gene(s) from a host cell, preferably the copy is partially 
deleted; and 

ii) at least one copy of a gene of interest flanked on one side by i) 
and on the other side by a DNA fragment homologous to a host cell DNA 
10 sequence located on the host cell chromosome adjacent to the 

conditionally essential gene(s) of i) . 

20. The DNA construct of claim 19, wherein the chromosomal gene(s) of the 
host cell that is altered in i) encodes a D-alanine racemase, preferably the 

15 gene(s) is a dal homologue from a Bacillus cell, more preferably the gene is 
homologous to dal from Bacillus subtilis, and most preferably the gene is 
the dal gene of Bacillus licheniformis. 

21. The DNA construct of claim 19, wherein the chromosomal gene(s) of the 

20 host cell that is altered in i) encodes a D-alanine racemase and is at least 
75% identical, preferably 85% identical, more preferably 95% and most 
preferably at least 97% identical to the dal sequence of Bacillus 
licheniformis shown in positions 1303 to 2469 in SEQ ID 12. 

25 22. The DNA construct of claim 19, wherein the chromosomal gene(s) of the 
host cell that is altered in i) is one or more genes of a xylose isomerase 
operon, preferably the gene(s) is homologous to the xylR gene or the xylA 
gene from Bacillus subtilis, more preferably the gene(s) is both a xylR and 
a xylA homologue, and most preferably the gene(s) is homologous to one or 

30 more genes of the xylose isomerase operon of Bacillus licheniformis . 

23. The DNA construct of claim 19, wherein the chromosomal gene(s) of the 
host cell that is altered in i) encodes a galactokinase (EC 2.7.1.6), an 
UTP-dependent pyrophosphorylase (EC 2.7.7.10), an UDP-glucose- dependent 
35 uridylyltransferase (EC 2.7.7.12), or an UDP-galactose epimerase (EC 

5.1.2.3), preferably the gene(s) encodes an UDP-galactose epimerase (EC 
5.1.2.3), more preferably the gene(s) is homologous to the galE gene of 
Bacillus subtilis, and most preferably the gene(s) is the galE gene of 
Bacillus licheniformis. 
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24. The DNA construct of claim 19, wherein the chromosomal gene(s) of the 
host cell that is altered in i) is one or more genes of a gluconate operon, 
preferably the gene{s) encodes a gluconate kinase (EC 2.7.1.12) or a 
gluconate permease or both, more preferably the gene(s) is homologous to one 

5 or more genes from Bacillus subtilis chosen from the group consisting of 
gntR, gntK, gntP, and gntZ, and most preferably the gene(s) is one or more 
genes of gntR, gntK, gntP, and grntZ from Bacillus licheniformis. 

25. A host cell comprising at least two copies of a gene of interest stably 
10 integrated into the chromosome, where each copy is integrated adjacent to 

different conditionally essential loci, wherein the cell is obtainable by 
any of the methods defined in claims 1-18. 

26. The cell of claim 25, wherein the gene of interest is separated from the 
15 conditionally essential locus by no more than 1000 basepairs, preferably no 

more than 750 basepairs, more preferably no more than 500 basepairs, even 
more preferably no more than 250 basepairs, and most preferably no more than 
100 basepairs. 

20 27. The cell of claims 25 or 26, which contains only endogenous DNA. 

28. The cell of any of claims 25-27, which is a Gram-positive bacterial 
cell, preferably a Bacillus cell, and most preferably a Bacillus cell chosen 
from the group consisting of Bacillus alkalophilus , Bacillus 
25 amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, 
Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus 
licheniformis, Bacillus megaterium, Bacillus stearothermophilus , Bacillus 
subtilis , and Bacillus thuringiensis. 

30 29. The cell of any of claims 25 - 27, wherein a copy of the gene of 

interest is integrated adjacent to a gene encoding a D-alanine racemase, 
preferably a gene homologous to the dal gene from Bacillus subtilis, more 
preferably a gene at least 75% identical to the dal sequence of Bacillus 
licheniformis shown in positions 1303 to 2469 in SEQ ID 12, even more 

35 preferably a gene at least 85% identical, more preferably at least 95% and 
most preferably at least 97% identical to the dal sequence of Bacillus 
licheniformis shown in positions 1303 to 2469 in SEQ ID 12 

30. The cell of any of claims 25 - 27, wherein a copy of the gene of 
40 interest is integrated adjacent to a gene of a xylose isomerase operon, 
preferably adjacent to genes homologous to the xylR or xylA genes from 
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Bacillus subtilis, and most preferably adjacent to xylR or xylA from 
Bacillus licheniformis . 



31. The cell of any of claims 25 - 27, wherein a copy of the gene of 
5 interest is integrated adjacent to a gene encoding a galactokinase (EC 

2.7.1.6), an UTP-dependent pyrophosphorylase (EC 2.7.7.10), an UDP-glucose- 
dependent uridylyltransf erase (EC 2.7.7.12), or an UDP-galactose epimerase 
(EC 5.1.2.3), preferably adjacent to a gene encoding an UDP-galactose 
epimerase (EC 5.1.2.3), more preferably adjacent to a gene homologous to the 
10 galE gene from Bacillus subtilis, and most preferably adjacent to galE from 
Bacillus licheniformis. 



32. The cell of any of claims 25 - 27, wherein a copy of the gene of 
interest is integrated adjacent to a gene of a gluconate operon, preferably 

15 adjacent to a gene that encodes a gluconate kinase (EC 2.7.1.12) or a 
gluconate permease, more preferably adjacent to a gene homologous to a 
Bacillus subtilis gene chosen from the group consisting of gntR, gntK, gntP, 
and gntZ, and most preferably adjacent to gntR, gntK, gntP, or gntZ from 
Bacillus licheniformis. 

20 

33. The cell of any of claims 25-32, wherein the gene of interest encodes 
an enzyme, preferably an amylolytic enzyme, a lipolytic enzyme, a 
proteolytic enzyme, a cellulytic enzyme, an oxidoreductase or a plant cell- 
wall degrading enzyme, and more preferably an enzyme with an activity 

25 selected from the group consisting of aminopeptidase, amylase, 

amyloglucosidase, carbohydrase, carboxypeptidase, catalase, cellulase, 
chitinase, cutinase, cyclodextrin glycosyltransf erase, deoxyribonuclease, 
esterase, galactosidase , beta-galactosidase, glucoamylase, glucose oxidase, 
glucosidase, haloperoxidase, hemicellulase, invertase, isomerase, laccase, 

30 ligase, lipase, lyase, mannosidase, oxidase, pectinase, peroxidase, phytase, 
phenoloxidase, polyphenoloxidase, protease, ribonuclease, transferase, 
transglutaminase, or xylanase. 

34. The cell of any of claims 25 - 33, wherein no antibiotic markers are 
35 present. 



35. A Bacillus licheniformis cell, wherein at least two conditionally 
essential genes are rendered non-f unctional , preferably the genes are chosen 
from the group consisting of xylR, xylA, galE, gntR, gntK, gntP, gntZ, and 
40 dal . 
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36. Use of a cell as defined in claim 35 in a method as defined in any of 
claims 1-18. 



37. A cell comprising a DNA construct as defined in claims 19-24. 

5 

38. A process for producing an enzyme of interest, comprising cultivating a 
cell as defined in any of claims 25-34 under conditions appropriate for 
producing the enzyme, and optionally purifying the enzyme. 
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Abstract 

The present invention solves the problem of integrating multiple 
copies of a gene of interest by homologous recombination into well defined 
positions adjacent to conditionally essential genes in a bacterial host 
5 strain chromosome, which already comprises at least one copy of the gene of 
interest in a different position. 
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SEQUENCE LISTING 



<110> Novo Nordisk A/S 

<120> Multiple insertion of genes 

<130> 10022. 000-DK 

<140> 
<141> 

<160> 25 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer #183235 
<400> 1 

gactaagctt ctgcatagtg agagaagacg 30 



<210> 2 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
#183234. 

<400> 2 

gactgaattc agatctgcgg ccgcacgcgt gtcgacagta ctgaaataga ggaaaaaata €0 
agttttc 67 



<210> 3 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description f Artificial Sequence: Primer #183230 



1 



<400> 3 

gactgaattc cgtatccatt cctgcgatat gag 



33 



<210> 4 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
#183227 

<400> 4 

gactggatcc agate ttatt acaaccctga tgaatttgtc g 41 

<210> 5 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer #183229 
<400> 5 

gactggatcc agatctgeta gcatcgatcc gcggctattt ccattgaaag cgattaattg 60 

<210> 6 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer #187338 
<400> 6 

tatttcccga gattctgtta tcgactcgct c 31 



<210> 7 

<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 



2 



<223> Description of Artificial Sequence: Primer #187339 
<400> 7 

gttttcggcc gctgtccgtt cgtcttt 27 



<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer #184733 
<400> 8 

gtgtgacgga taaggccgcc gtcattg 27 



<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequences Primer #184788 
<400> 9 

ctcttgtctc ggagcctgca ttttgggg 2 8 



<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
#B1656D07 

<400> 10 

agcattattc ttcgaagtcg cattgg 26 



<210> 11 

<211> 45 

<212> DNA 

<213> Artificial 



Sequence 



3 



<220> 

<223> Description of Artificial Sequence: Primer 
#B1659F03 

<400> 11 

ttaagatctt ttttatacaa ataggcttaa caataaagta aatcc 45 



<210> 12 
<211> 3342 
<212> DNA 

<213> Bacillus lichenif ormis 

<220> 
<221> CDS 

<222> (1303) . . (2469) 

<223> DNA sequence of the dal-gene encoding D- alanine 
racemase 

<400> 12 

gcgtaccgtt aaagtcgaac agcggtttct tcctttttac atccatggat taaaaagggg 60 
ttgaaaaaag gtgagaaaaa gctttgtttt gcttttaacg gggctgcatg taatccttat 120 
gctttctgcc tgcggccaaa aatcgcaaga agatgttgtg acggggctcg acaagaaggc 180 
aaaagaatac acgtcctata aggcaaaagc gaaaatgacc attgaaacgg ggaatgaccc 240 
gcaggagtac aacgtggaaa tctggcataa aaaaccttct ctttaccggg tctatttgga 300 
aaacccgaaa aaagaccaga gccaggtgat cttgcgcaat gaaaacggcg tgtttgtttt 360 
gactccgtcg ctgaataaaa gcttccgctt tcacagcgac tggcccaata acagcagcca 420 
ggtatactta ttcgaatcgc tcgtaaagga tgtcaaaaat gatggggaag cttctttttc 480 
cgcaaaggat tcaaaataca tttttgaaac gaaaacgaat tatcagcata atcagatgct 540 
gccgactcag gaaatcgttt tccataaaaa gaccatggct ccttcatcgg ttaaagtgat 600 
ggataccgac cgcaaaccga tggtaaaggt tgagtttaca agctttgaat tcgataagcc 660 
gctcgataaa gactcttttg atgaaaagaa aaatatgacg ctgtctcaaa ttgacgtagc 720 
gacaagcgct gacgtgtcag actctttcgc tgtcaaaacg ccgctcgatg tgcctcaggg 780 
cgtgaaaaag cttgaagaga aagagatggc gactgaagac ggcaaacgga tcgtcatcac 840 



4 



atatggcggt gaaaaatcct ttacattgat tcaggaaaaa gcccgcgtcg ccaaaacatc 900 



cacttccgta tccatgaacg gagagcccgt tgacctcggc ttcacggtcg gcgcactgac 960 

ggataaatcg ttgtcatgga catatgacgg agtcgattac tttatctcat cagaagatct 1020 

ttctcaagat gaacttctga tggttgcaaa aagcatgcag ggacagtctt cgaaatagac 1080 

tgtgccgtat ccggcagcct gttttccgcc cggaagcgga aagcaggctt ttttatattt 1140 

gcgtcgcaag cgtatgattt cgacagcttt tccgtaaaat gtataccgtg ccagcaattt 1200 

ttcttttgtt cagggcbgat gatcccgtgc aaaatttccc tttctccgaa ctttttagta 1260 

tgatgggaag gacgagtgaa acaaggaaca ggaagtgtca tg atg age tta aaa 1314 

Met Ser Leu Lys 
1 

cca ttc tat aga aag aca tgg gec gaa ate gat tta acg get tta aaa 1362 
Pro Phe Tyr Arg Lys Thr Trp Ala Glu lie Asp Leu Thr Ala Leu Lys 
5 10 15 20 

gaa aac gtc cgc aat atg aag egg cac ate ggc gag cat gtc cgc ctg 1410 
Glu Asn Val Arg Asn Met Lys Arg His He Gly Glu His Val Arg Leu 
25 30 35 

atg gee gtc gtt aaa gcg aat gee tac gga cac ggg gat gca cag gta 1458 
Met Ala Val Val Lys Ala Asn Ala Tyr Gly His Gly Asp Ala Gin Val 
40 45 50 

gcg aag gcg get ctt gca gaa ggg gcg tec att ctt get gtg get tta 1506 
Ala Lys Ala Ala Leu Ala Glu Gly Ala Ser lie Leu Ala Val Ala Leu 
55 60 65 

ttg gat gaa gcg ctt teg ctg agg gcg cag ggg att gaa gaa ccg att 1554 
Leu Asp Glu Ala Leu Ser Leu Arg Ala Gin Gly lie Glu Glu Pro He 
70 75 80 

ctt gtc etc ggt gca gtg ccg ace gaa tat gca age att gee gcg gaa 1602 
Leu Val Leu Gly Ala Val Pro Thr Glu Tyr Ala Ser lie Ala Ala Glu 
85 90 95 100 

aag cgc att ate gtg act ggc tac tec gtc ggc tgg ctg aaa gac gtg 1650 
Lys Arg lie lie Val Thr Gly Tyr Ser Val Gly Trp Leu Lys Asp Val 
105 110 115 

etc ggt ttt ctg aat gag gec gaa get cct ctt gaa tat cat ttg aag 1698 
Leu Gly Phe Leu Asn Glu Ala Glu Ala Pro Leu Glu Tyr His L u Lys 
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12 0 



125 



130 



ate gac acg ggc atg ggc cgc ctt ggc tgc aaa acg gaa gaa gag ate 1746 
lie Asp Thr Gly Met Gly Arg Lou Gly Cys Lys Thr Glu Glu Glu II 
135 140 145 

aaa gaa atg atg gag atg acc gaa teg aac gat aag etc aat tgt acg 1794 
Lys Glu Met Met Glu Met Thr Glu Ser Asn Asp Lys Leu Asn Cys Thr 
150 155 160 

ggc gtg ttc act cat ttc gec acg gcg gac gaa aag gac acc gat tat 1842 
Gly Val Phe Thr His Phe Ala Thr Ala Asp Glu Lys Asp Thr Asp Tyr 
165 170 175 180 

ttc aac atg cat ctt gac cgc ttt aaa gag ctg ate age ccc ttc ccg 1890 
Phe Asn Met His Leu Asp Arg Phe Lys Glu Leu lie Ser Pro Phe Pro 
185 190 195 

ctt gac cgt ttg atg gtg cat teg tea aac age gee gcg ggt ctg cgc 1938 
Leu Asp Arg Leu Met Val His Ser Ser Asn Ser Ala Ala Gly Leu Arg 
200 205 210 

ttc agg gaa cag eta ttt aat gec gtc cgc ttc ggc ate ggc atg tac 1986 
Phe Arg Glu Gin Leu Phe Asn Ala Val Arg Phe Gly lie Gly Met Tyr 
215 220 225 

ggt ttg gcg ccg tea acc gaa ata aaa gac gag ctg ccg ttt cgt ctg 2034 
Gly Leu Ala Pro Ser Thr Glu lie Lys Asp Glu Leu Pro Phe Arg Leu 
230 235 240 

egg gaa gtg ttt teg ctt cat acc gaa etc acc cat gtg aaa aaa att 2082 
Arg Glu Val Phe Ser Leu His Thr Glu Leu Thr His Val Lys Lys lie 
245 250 255 260 

aaa aaa ggc gag age gtc age tac ggg gcg aca tat aca get cag cgc 2130 
Lys Lys Gly Glu Ser Val Ser Tyr Gly Ala Thr Tyr Thr Ala Gin Arg 
265 270 275 

gac gaa tgg ate ggg aca gtc ccc gtc ggg tat gec gac gga tgg ctg 2178 
Asp Glu Trp lie Gly Thr Val Pro Val Gly Tyr Ala Asp Gly Trp Leu 
280 285 290 

agg cgc ctg gee gga acg gaa gtg ctg ate gac gga aaa cgc caa aaa 2226 
Arg Arg Leu Ala Gly Thr Glu Val Leu lie Asp Gly Lys Arg Gin Lys 
295 300 305 

ata gca ggg aga ate tgc atg gac cag ttc atg att tec ctt gee gaa 2274 
lie Ala Gly Arg He Cys Met Asp Gin Phe Met He Ser Leu Ala Glu 
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310 315 320 

gaa tac cct gtc ggc aca aag gtt acc ttg ate gga aag caa aaa gac 2322 
Glu Tyr Pro Val Gly Thr Lys Val Thr Leu II Gly Lys Gin Lys Asp 
325 330 335 340 

gaa tgg ate tea gtc gac gaa ate gee caa aat ttg cag acg ate aat 237 0 
Glu Trp lie Ser Val Asp Glu lie Ala Gin Asn Leu Gin Thr lie Asn 
345 350 355 

tat gaa att acc tgt atg ata agt tea agg gtg ccc cgt atg ttt ttg 2418 
Tyr Glu lie Thr Cys Met Xle Ser Ser Arg Val Pro Arg Met Phe Leu 
360 365 370 

gaa aat ggg agt ata atg gaa ata agg aat ccg ate ttg cct gat caa 2466 
Glu Asn Gly Ser lie Met Glu lie Arg Asn Pro Xle Leu Pro Asp Gin 
375 380 385 

tec tgaaaattga tgaattagcg gaaaaacaac tttgcttgcg aaaagaataa 2519 
Ser 

tgatatgatt atgaatggaa tggatagagt gttgtatccg taagtttggt ggaggtgtat 2 579 
gtttttgtct gaatccagcg caacaactga aatattgatt cgcttgccag aagctttagt 2639 
atcagaactg gatggtgtcg teatgegaga taacegggag cagganatga actgatttta 2 699 
ccaagccaca aaaatgtagg aacgegaacg caaaaaatcg acaaattegg ggaatcgatg 2759 
agaageggtt atatggagat ggecaagate caatttgaac atctcttctg aggctcaatt 2819 
tgcagagtat gaggctgaaa acacagtaga gegcttacta ageggatgat aatcatttga 2 87 9 
ttgttaaacg cggcgatgtt tattttgetg acetate tec tgttgttggc tcagaacaag 2 939 
gcggggtgcg cccggtttta gtgattcaaa acaacategg caatcgcttc agcccaactg 2 999 
ctattgttgc agecataaca gcccaaatac agaaagcaaa attacctacc caegtcgaaa 3 059 
ttgatgcgaa aegctaeggt tttgaaagag actcegttat attgetcgaa caaattegga 3119 
cgattgacaa gcaaagatta aeggacaaaa tcacccatct cgatgatgaa atgatggaaa 317 9 
aggtcaacga agecttacaa atcagtttgg cactcattga tttttaatat tgatgaaagt 323 9 
tgetcgagge gaaagagcaa ctttttttgt gttcaaaaat aacaatacga tataatggta 3299 
actgttagtc ctaaaaatgt tagecagatg tagtcagggg gat 3 342 
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<210> 13 
<211> 389 
<212> PRT 

<213> Bacillus licheni f ormis 
<400> 13 

Met Ser Leu Lys Pro Phe Tyr Arg Lys Thr Trp Ala Glu lie Asp Leu 
15 10 15 

Thr Ala Leu Lys Glu Asn Val Arg Asn Met Lys Arg His lie Gly Glu 
20 25 30 

His Val Arg Leu Met Ala Val Val Lys Ala Asn Ala Tyr Gly His Gly 
35 40 45 

Asp Ala Gin Val Ala Lys Ala Ala Leu Ala Glu Gly Ala Ser lie Leu 
50 55 60 

Ala Val Ala Leu Leu Asp Glu Ala Leu Ser Leu Arg Ala Gin Gly lie 
65 70 75 80 

Glu Glu Pro lie Leu Val Leu Gly Ala Val Pro Thr Glu Tyr Ala Ser 
85 90 95 

lie Ala Ala Glu Lys Arg lie lie Val Thr Gly Tyr Ser Val Gly Trp 
100 105 110 

Leu Lys Asp Val Leu Gly Phe Leu Asn Glu Ala Glu Ala Pro Leu Glu 
115 120 125 

Tyr His Leu Lys lie Asp Thr Gly Met Gly Arg Leu Gly Cys Lys Thr 
130 135 140 

Glu Glu Glu lie Lys Glu Met Met Glu Met Thr Glu Ser Asn Asp Lys 
145 150 155 160 

Leu Asn Cys Thr Gly Val Phe Thr His Phe Ala Thr Ala Asp Glu Lys 
165 170 175 

Asp Thr Asp Tyr Phe Asn Met His Leu Asp Arg Phe Lys Glu Leu He 
180 185 190 

Ser Pro Phe Pro Leu Asp Arg Leu Met Val His Ser Ser Asn Ser Ala 
195 200 205 

Ala Gly Leu Arg Phe Arg Glu Gin Leu Phe Asn Ala Val Arg Phe Gly 
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210 

lie Gly Mot Tyr Gly 
225 

Pro Phe Arg Leu Arg 
245 

Val Lys Lys lie Lye 
260 

Thr Ala Gin Arg Asp 

275 

Asp Gly Trp Leu Arg 
290 

Lys Arg Gin Lys lie 
305 

Ser Leu Ala Glu Glu 
325 

Lys Gin Lys Asp Glu 
340 

Gin Thr lie Asn Tyr 
355 

Arg Met Phe Leu Glu 
370 

Leu Pro Asp Gin Ser 
385 



215 

Leu Ala Pro Ser Thr Glu 
230 235 

Glu Val Phe Ser Leu His 
250 

Lys Gly Glu Ser Val Ser 
265 

Glu Trp lie Gly Thr Val 
280 

Arg Leu Ala Gly Thr Glu 
295 

Ala Gly Arg lie Cys Met 
310 315 

Tyr Pro Val Gly Thr Lys 
330 

Trp lie Ser Val Asp Glu 
345 

Glu lie Thr Cys Met lie 
360 

Asn Gly Ser lie Met Glu 
375 



220 

lie Lys Asp Glu Leu 
240 

Thr Glu Leu Thr His 
255 

Tyr Gly Ala Thr Tyr 
270 

Pro Val Gly Tyr Ala 
285 

Val Leu Tie Asp Gly 
300 

Asp Gin Phe Met lie 
320 

Val Thr Leu lie Gly 
335 

lie Ala Gin Asn Leu 
350 

Ser Ser Arg Val Pro 
365 

lie Arg Asn Pro lie 
380 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 
14877 9; [Upstream of a SphI site in the dal 
regi n] 

<400> 14 



9 



gatgaacttc tgatggttgc 



20 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 
148780: [BamHI < dal] 

<400> 15 

aaaggatccc cctgactaca tctggc 26 



<210> 16 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 
170046 [NotI; < erxnC gene and the replication 
origin of pE194>] 

<400> 16 

aaagcggccg cgagactgtg acggatgaat tgaaaaagc 39 



<210> 17 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 

170047 [EcoRI; < - ermC gene and the replication 
origin of pE194->] 

<400> 17 

aaagaattcg tgaaatcagc tggactaaaa gg 32 



<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer # 
150506 [BamHI; < dal gene] 

<400> 18 

aaaggatccc gcaagcaaag ttgtttttcc gc 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 
150 507 [KpnX; <- dal gene] 

<400> 19 

aaaggtaccg aaagacatgg gccgaaatcg 

<210> 20 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 

158089 [KpnX; < Tl and T2 Terminators of rrnB] 

<400> 20 

aaaggtaccg gtaatgactc tctagcttga gg 

<210> 21 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 

158090 [Clal; < Tl and T2 Terminators of rrnB] 

<400> 21 

caaatcgatc atcaccgaaa cgcggcaggc age 
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<210> 22 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 

150508 [Hindll I; < DPS] 

<400> 22 

attaagcttg atatgattat gaatggaatg g 

<210> 23 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 

150509 [Nhel; < DPS] 

<400> 23 

aaagctagca tccccctgac tacatctggc 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer # 
145507 [ < ydcC - dal - DPS >] 

<400> 24 

gcgtaccgtt aaagtcgaac agcg 

<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Descripti n f Artificial Sequence: Primer # 
150509 [Nhel; < ydcC - dal - DPS >] 
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<400> 25 

aaagctagca tccccctgac tacatctggc 
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